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DETECTION OF MOLECULAR INTERACTIONS BY 
REPORTER SUBUNIT COMPLEMENTATION 



CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims priority to U.S. Provisional Patent Applications Serial No. 
60/042,576 filed April 2, 1997 and Serial No. 60/054,638, filed August 4, 1997; and U.S. 
Patent Application Attorney Docket No. 28600-20206.00, filed April 1 , 1 998. 

STATEMENT OF RIGHTS TO INVENTIONS MADE 
UNDER FEDERALLY SPONSORED RESEARCH 
Not applicable. 

TECHNICAL FIELD 
This invention is in the field of molecular biology and, more specifically, in the 
field of reporter systems useful for the analysis of protein-protein interactions. 

BACKGROUND 

The P-galactosidase enzyme (P-gal), the protein product of the E. coli lacZ gene, is 
widely used in studies of gene expression and cell lineage in higher organisms. Several 
biochemical assays of p-gal activity, including live-cell flow cytometry and histochemical 
staining with the chromogenic substrate 5-bromo-4-chloro-3-indolyl 
P-D-galactopyranoside (X-gal) make the product of the /acZ gene extremely versatile as a 
quantitative reporter enzyme, selectable marker, or histological indicator. Bronstein et al. 
(1989)^ Biolumin. Chemilumm. 4:99-111; Nolan e/ a/. (1988) Proc. Natl Acad. Sci. USA 
85:2603-2607; and Lojda (1979) Enzyme Histochemistry: A Laboratory Manual, Springer, 
Berlin. One property of the lacZ system that has been well-characterized in studies of 
bacterial genetics, but has not been exploited in eukaiyotes is the phenomenon of 
intracistronic complementation. Studies in E. coli have shown that deletions of p-gal 
which remove portions of either the N-tenninus or the C-tertninus produce enzyme which 
is inactive. However, coexpression of one of these deletion mutants with a second inactive 
deletion mutant containing domains that are lacking in the first can restore p-gal enzymatic 
activity in a process called complementation. This complemented p-gal activity arises by 



concentration-dependent assembly of a stable hetero-octameric enzyme complex 
comprising all the essential domains of the wild-type homotetramer. Ullman et al (1965) 
J. Mol Biol 12:918-923; Ullman et al (1967) 1 Mol Biol 24:339-343; and Ullman et 
al (1968) J. Mol Biol 32:1-13. 

A system utilizing p-gal complementation in enzyme assays has been described. 
Henderson, U.S. Patent 4,708,929. In this system, enzymatically inactive p-gal 
polypeptide fragments, capable of combining with high affinity to form active p-gal by 
complementation, are used. One of the fragments is conjugated to analyte, which allows it 
to compete with analyte for binding to an analyte-binding protein. If bound to the analyte- 
binding protein, the p-gal fragment is unable to complement. Thus, by comparing P-gal 
activity in the presence of sample to that obtained in the presence of a known concentration 
of analyte (at equal concentrations of analyte-binding protein) the amount of analyte in the 
sample may be determined. This method requires high-affinity complementing subunits of 
p-gal, requires that an analyte-binding protein be known, and is not applicable to single- 
cell analysis. . 

Previous systems for the study of protein-protein interactions have been described 
which utilize two fusion genes whose products reconstitute the function of a transcriptional 
activator. Fields et al, (1989) Nature 340:245-247; Bai et al, (1996) Metk Enzymol 
273:33 1-347; Luo et al, (1997) BioTechniques 22(2) ;350-352. In one fusion gene, a 
sequence encoding a first protein is conjugated to a sequence encoding a DNA-binding 
domain of a transcriptional regulatory protein. In a second fusion gene, a sequence • 
encoding a second protein is conjugated to a sequence encoding a transcriptional activation 
domain of a transcriptional regulatory protein. The two fusion genes are co-transfected 
into a cell which also contains a reporter gene whose expression is controlled by a DNA 
regulatory sequence that is bound by the DNA-binding domain encoded by the first fusion 
gene. Expression of the reporter gene requires that a transcriptional activation domain be 
brought adjacent to the DNA regtdatory sequence. Binding of the first protein to the 
second protein will bring the transcriptional activation domain encoded by the second 
fusion gene into proximity with the DNA-binding domain encoded by the first fusion gene, 
thereby stimulating transcription of the reporter gene. Thus, the level of expression of the 
reporter gene will reflect the degree of binding between the first and second proteins. 



There are several disadvantages associated with the use of the above-mentioned 
system. As it is dependent upon transcriptionally-regulated expression of a reporter gene, 
this system is limited to the assay of interactions that take place in the nucleus. In 
addition, the assay is indirect, relying on transcriptional activation of a reporter gene whose 
product is diffusible. Hence, a method which would allow a direct and immediate 
examination of molecular interactions, at the site where they occur, would be desirable. 

A system for detecting protein-protein interactions, not limited to nuclear 
interactions, has been described. U.S. Patent Nos. 5,503,977 and 5,585,245. In this 
system, fusions between potential interacting polypeptides and mutant subunits of the 
protein ubiquitin are formed. Juxtaposition of the two ubiquitin subunits brought about by 
interaction between potential interacting polypeptides creates a substrate for a ubiquitin- 
specific protease, and a small peptide reporter fragment is released. In this system, binding 
between the potential interacting polypeptides does not generate any type of enzymatic 
activity; therefore, signal amplification is not possible. Additionally, the ubiquitin system 
does not measure activity in intact cells, but relies on assays of proteolysis in cell-free" 
extracts, What is needed is a sensitive method for examining protein interactions 'in intact 
cells in the relevant cellular compartment. 

Fluorescence imaging has been used to study the intracellular biochemistry of 
living cells. A fluorescent indicator for the adenosine 3 ',5 '-cyclic monophosphate (cAMP) 
signaling pathway has been described in which the sensor is a cAMP kinase in which the 
catalytic and regulatory subunits each are labeled with a different fluorescent dye, such as 
fluorescein or rhodamine, capable of fluorescence resonance energy transfer in the 
holoenzyme complex. A change in shape of the fluorescence emission spectrum occurs 
upon cAMP binding, and therefore activation of the kinase can be visualized in cells 
microinjected with the labeled holoenzyme. Adams et al.. Nature, 349: 694-697 (1991). 
This system is limited by the fact that it requires microinjection, and a preferred distance 
between the labeled units for energy transfer to occur. 

Substrates for P-lactamase have been described in the art which include a 
fluorescent donor moiety and a quencher, which include an attached group which makes 
them permeable through cell membranes, wherein the attached group is hydrolyzed off 
after the substrate enters the cell. Fluorescence energy transfer between the donor and 
quencher is monitored as an indicator of P-lactamase activity. This system also can be 



used in a reporter gene assay using cells containing p-lactamase reporter genes functionally 
linked to a promoter. PCT WO 96/30540 pubUshed October 3, 1996, the disclosure of 
which is incorporated herein. 

DISCLOSURE OF THE INVENTION 

The present invention provides methods and compositions for detecting, assaying 
and quantitating molecular interactions within living cells and in vitro, through 
complementation between two or more low affinity reporter subunits, such as distinct £. 
coli lacZ mutations. In a preferred embodiment, protein-protein interactions within living 
cells are detected and quantitated using the methods and compositions of the present 
invention. The practice of the present invention enables, for the first time, the study of 
protein-protein interactions and their control in living mammalian cells vdthout reliance 
upon the transcriptional activation of a reporter gene construct. Association of the proteins 
of interest results directly in enzyme activity and is independent of other cellular functions. 
Therefore, the present invention provides advantages over other systems currently in use 
by allowing the detection of complexes that are excluded from the nucleus, and detection 
of complexes whose formation would inhibit transcription. Furthermore, the present 
invention allows the detection and localization of specific binding interactions within cells 
at different stages of development and differentiation, and an analysis of the induction or 
inhibition of binding interactions in cells. 

Interactions occurring vsdthin the nucleus of the cell, interactions occurring in the 
cytoplasm, on the cell surface, within or on the surface of organelles, or between 
cytoplasmic and surface (either cellular or organellar) molecules, as well a interactions 
occurring outside the cell, are all capable of being detected in the practice of the present 
invention. Thus, the invention surmounts the limitations associated with previous assays 
for protein-protein interactions, which were either limited to interactions occurring in the 
nucleus, or did not always allow accurate localization of molecular interactions, and which 
were not well-suited for detection of interactions which resulted in inhibition of 
transcription or translation. 

Accordingly, in one embodiment, the invention provides a reporter system 
component comprising: 

a first low-affinity reporter subunit, coupled to a first putative binding moiety; 



wherein the first low-aflanity reporter subunit is capable of association with at least 
a second low-affinity reporter subunit to generate a detectable signal, said association 
being mediated by the first putative binding moiety. 

In another embodiment, the invention provides a method of determining the 
occurrence of binding between first and second putative binding moieties, the method 
comprising: 

a) providing a reporter system comprising: 

a first component comprising a first low affinity reporter subunit, 

coupled to the first putative binding moiety; and 

a second component comprising a second low affinity reporter 

subunit coupled to the second putative binding moiety; 

wherein the first low affinity reporter subunit is capable of association with 

at least the second low affinity reporter subunit to generate a detectable signal, said 

association being mediated by the binding of the first and second putative binding 
moieties; " 

b) combining the first component and the second component; and 

c) detecting the presence or absence of the signal. 

In a further embodiment, the invention provides a method of screening for binding 
of a first binding moiety with members of a plurality of different second putative binding 
moieties, the method comprising: 

a) providing a plurality of reporter systems each comprising: 

a first component comprising a first low affinity reporter subunit coupled to 
the first binding moiety, and 

one of a plurality of second components each comprising a second low 
affinity reporter subunit coupled to one of said plurality of second putative binding 
moieties, wherein in each of said second components, said second putative binding 
moiety is different; 

wherein the first low affinity reporter subunit is capable of association with 
the second low affinity reporter subimit to generate a detectable signal upon the 
binding of the first binding moiety with one of said different second putative 
binding moieties; 



b) individually combining the first component with each of the plurality oj' 
second components to produce a plurality of binding assay samples, each of which 
includes the first component and a different one of the second components; and 

c) detecting the presence or absence of the signal in each of the binding assay 
samples. 

The invention additionally provides nucleic acids encoding fusion proteins 
including a low affinity reporter subunit and a putative binding moiety, and the fusion 
proteins encoded by said nucleic acids. The invention further provides viral vectors 
comprising nucleic acids encoding such fusions proteins. The invention also provides cells 
transformed by the nucleic acids and viral vectors described above. 

All patents, patent applications and publications referred to herein are hereby 
incorporated by reference. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic illustration of three deletion mutant /acZ constructs, 
designated Aa, Acd and A^i. 

Figure 2A is a schematic illustration of a viral construct encoding fusion proteins 
of the Aa or Aco P-gal mutants with either the intracellular FKBP-rapamycin associated 
protein (FRAP) or the intracellular rapamycin binding protein, FK506-binding protein- 12 
(FKBPl 2) upstream of the hygromycin or neomycin resistance genes. 

Figure 2B is a schematic illustration of a viral construct encoding fusion proteins 
of the Aa or Aco p-gal mutants with either FRAP or FKBPl 2 and another protein, 
represented as x and x', upstream of the hygromycin or neomycin resistance genes. 

Figures 3 A and 3B show X-gal staining of fixed cells expressing both FKBP12- 
Ao and FRAP-Aa. Cells shown in 3b were exposed to 10 ng/ml rapamycin for 12 hr. 
Cells shown in 3a were not exposed to rapamycin. 

Figure 4A is a graph of p-gal activity vs. time with and without rapamycin 
treatment of C2C 1 2 cells expressing both FKBPl 2-Aci) and FRAP-Aa fusion proteins. 

Figure 4B is a graph of the dose-response to rapamycin of the activity of p-gal in 
C2C1 2 cells expressing both FKBP12-Aa) and FRAP-Aa fusion proteins. 



Figure 5 shows rapamycin-dependent increase in P-gal activity in lysates from 
cells expressing both FKBP12-A© and FRAP-Aa fusion proteins, measured by 
chemiluminescence. 

Figure 6A shows analysis by Fluorescence- Activated Cell Sorting (FACS) of 
C2C12 cells expressing both FKBP12-Aq) and FRAP-Aa after 90 minutes of rapamycin 
treatment. Dark peaks represent profiles obtained from untreated samples; light peaks 
represent profiles from samples that have been treated with 10 ng/ml rapamycin. 

Figure 6B shows a FACS profile of untreated cells and indicates a subpopulation 
selected on the basis of low p-gal activity. 

Figure 6C shows FACS analysis of the subpopulation of cells selected in Figure 
6B after overnight culture in the absence (dark peak) or presence (light peak) of rapamycin. 
In Figures 6A, 6B and 6C, the vertical axis represents cell number and the horizontal axis 
represents intensity of p-gal fluorescence expressed on a logarithmic scale. 

: Figure 7 shows EGF receptor dimerization monitored using p-gal 
complementation. 

Figure 7A depicts schematically the rationale of the assay: two weakly 
complementing deletion mutants of p-gal are linked to the extracellular and 
transmembrane domains of the EGF receptor. Receptor dimerization, stabilized by EGF, 
will drive p-gal complementation. 

Figure 7B shows the design of the retroviral constructs used in the assay. E. coli 
lacZ deletion mutants Aa and A© were cloned into pWZL vectors expressing neomycin or 
hygromycin resistance, respectively. The extracellular and transmembrane (tm) domains 
of human EGF receptor were cloned in frame with the Aa and Aco mutants. 

Figure 7C shows FACS analysis of a population of transduced and selected cells. 
EGF treatment increases the p-gal activity (fluorescein fluorescence) in a substantial 
proportion of the cells. The FACS profile of cells without EGF treatment is shaded in light 
gray arid is outlined in white. The profile of cells treated with EGF is shaded dark gray. 

Figure 7D shows FACS analysis of chimeric receptor expression, using a 
monoclonal antibody to the extracellular domain of the human EGF receptor. The FACS 
profile of the transduced and selected population is shaded medium gray and outlined in 
white; untransduced cells are shaded light gray and outlined in white. The FACS was used 
to clone cells that had low p-gal activity in the absence of EGF and showed increased p-gal 



activity in the presence of EGF. One clone that had low levels of the chimeric receptor 
relative to the population (shaded in dark gray) was used for further analyses. 

Figure 7E shows induction of EGF receptor dimerization (p-gal activity) in all of 
the cells of the clone selected in Figure 7D, upon treatment with 100 ng/ml EGF for two 
hours. Untreated cells are shaded in light gray and outlined in white; EGF treated cells are 
shaded in d^k gray. 

Figure 7F shows that dimerization can be detected after very short treatments with 
EGF. Cells were treated with 100 ng/ml EGF for 0, 1, 4, 8, and 15 minutes before cells 
were rinsed and processed for FACS analysis. The mean fluorescence of the cell sample is 
plotted. 

Figure 8 shows a time-course of EGF receptor dimerization and receptor 
expression on the cell surface, following treatment with EGF. Cells expressing chimeric 
receptors were treated with 100 ng/ml EGF for 0 to 24 hours. Dimerization, as measured 
by p-gal activity, was monitored by FACS, and the mean P-gal activity (fluorescein 
fluorescence) of the cells was plotted (left-hand axis; — ■— ). Chimeric receptor levels on 
the cell surface were measured on the FACS using a monoclonal antibody to the 
extracellular domain of the human EGF receptor and a phycoerythrin-labeled second 
antibody. Mean phycoerythrin fluorescence values are shown on the right-hand axis (--A- 
-). Triplicate samples were analyzed for each time point, and 5000 cells were analyzed for 
each sample. The error bars indicate the standard deviation of the replicate samples. 

Figure 9 shows that EGF receptor dimerization is enhanced by tyrphostin AG 1478. 

Figure 9A shows, in the left panel, schematic diagrams of different regimens for 
treatment of cells with EGF, tyrphostin, or both. After the various treatments, cells were 
analyzed on the FACS, and the mean fluorescence is shown in the right panel. Each 
treatment was performed in triplicate. 

Figure 9B shows measurements of p-galactosidase activity in EGF^treated cells 
compared with EGF+tyrphostin-treated cells. Cells expressing the chimeric receptor were 
treated with either 100 ng/ml EGF ( ■ ) or EGF and 100 nM tyrphostin AG1478 for 0 
to 24 hours (—A-). Triplicate samples were analyzed for each time point, and the error 
bars indicate the standard deviation of the replicate samples. 



MODES FOR CARRYING OUT THE INVENTION 

Definitions 

As used herein, the following tenns have the following definitions: 

As used herein, a "reporter subxinit" refers to a member of a cx)mplex of two or 
more subunits which are capable of associating with low binding affinity with each other 
to generate a detectable signal, or which are capable of associating with each other and one 
or more additional substances to generate a detectable signal, and which do not 
individually generate the detectable signal. 

As used herein, "low affinity" reporter subunits refer to molecular species which 
have a suflSciently low binding affmity for each other such that when they each are 
covalently attached to two different binding moieties, they substantially do not become 
associated unless a binding interaction between the two binding moieties occurs. "Low 
affinity" thus generally refers to a binding affinity which is at least less than that of the 
attached binding moieties. 

As used herein, "binding moieties" refers to at least two molecular species, such as 
proteins or ft-agments thereof, which interact with each other to form a stable complex. 

As used herein, a "detectable signal" refers to any detectable signal which occurs 
upon the association of the reporter subunits or via the interaction of the associated 
subunits with another substance. The detectable signal may be for example, a 
chromogenic, fluorescent, phosphorescent or chemiluminescent signal, such as a detectable 
product of an enzymatic reaction catalyzed by the associated reporter subunits. 

The terms "protein", "polypeptide", and "peptide" are used interchangeably herein 
to refer to polymers of amino acids of any length. The polymer may be linear or branched, 
it may comprise modified amino acids, and it may be interrupted by non-amino acids. It 
also may be modified naturally or by intervention; for example, disulfide bond formation, 
glycosylation, myristylation, acetylation, alkylation, phosphorylation or 
dephosphorylation. Also included within the definition are polypeptides containing one or 
more analogs of an amino acid (including, for example, unnatural amino acids) as well as 
other modifications known in the art. 

Unless otherwise indicated, the practice of the present invention will employ 
conventional techniques of molecular biology, biochemistry, microbiology, recombinant 
DNA, nucleic acid hybridization, genetics, immimology, embryology and oncology which 



are within the skill of the art. Such techniques are explained fully in the literature. See, for 
example, Maniatis, Fritsch & Sambrook, MOLECULAR CLONING: A LABORATORY 
MANUAL, Cold Spring Harbor Laboratory Press (1982); Sambrook, Fritsch & Maniatis, 
MOLECULAR CLONING: A LABORATORY MANUAL, Second Edition, Cold Spring 
Harbor Laboratory Press (1 989); Ausubel, et al. , CURRENT PROTOCOLS IN 
MOLECULAR BIOLOGY, John Wiley & Sons (1987, 1988, 1989, 1990, 1991, 1992, 
1993. 1994, 1995, 1996). 
Reporter Subunits 

As used herein, a "reporter subunit" refers to a member of a complex of two or 
more subunits which are capable of associating with low binding afFmity with each other 
to generate a detectable signal, or which are capable of associating with each other and one 
or more additional substances to generate a detectable signal, and which do not 
individually generate the detectable signal. 

The detectable signal tiius provides an indication that the subunits have become 
associated. In general, in an assay of the binding affinity ofa first and at least a second - 
molecular species (the "putative binding moiety"), a first component is provided which 
includes one reporter subunit attached to the first molecular species, and a second 
component is provided which includes another ofthe same or different reporter subuiiit 
attached to the second molecular species. The reporter subunits preferably have 
sufficiently low binding affinity for each other such that they substantially do not associate 
with each other in solution unless and until the molecules for which binding affinity is 
being assayed have sufficient binding affinity to mediate complex formation between the 
two components. Upon binding of the binding moieties and resulting association ofthe 
reporter subunits, generally by non-covalent interactions, such as hydrogen bonding or 
hydrophobic interactions, for example, the reporter subunits are oriented close enough to 
each other such that they are capable of associating with low affinity and generating a 
detectable signal. In the system, individual reporter subunits are not able to generate the 
detectable signal. Thus, the reporter subunits undergo forced complementation when 
brought into close proximity. 

The reporter subunits can be designed to have a preferred low affinity for a 
particular application and for the conditions in which the binding assay is done. Binding 
of molecules will depend upon factors in solution such as pH, ionic strength, concentration 



of components of the assay, and temperature. In the binding assays using reporter systems 
described herein, the binding affinity of the reporter subunits should be low enough to 
permit forced complementation. Non-limiting examples of dissociation constants of the 
reporter subunits in an assay solution, such as a buffered system or cell interior, are on the 
order of greater than about 10 ** M for example, greater than 10"* M or optionally, between 
about 10"^ to 10"^ M depending upon the properties of the particular assay system. 

Reporter subunits which have sufficiently low binding afifmity, and yet are still 
capable of associating and generating a detectable signal upon the binding of molecular 
species attached to them can be designed as disclosed herein. Reporter subunits which can 
be used include any low binding affinity subunits which are capable of associating to 
produce a detectable signal. In one preferred embodiment, the reporter subunits are 
proteins which are capable of associating and are capable when associated of catalyzing a 
reaction which produces a directly or indirectly detectable product. 

Protein enzymes capable of catalyzing conversion of a substrate to a detiectable 
reaction product, either directly or indirectly, which have been used, for example, in cell 
based screenmg assays may be used as reporter subunits. The enzymes can be modified 
into reporter subunits and to have a low binding affinity and the ability to undergo forced 
complementation. These may be modified, for example, by site directed or random 
mutagenesis, or deletion mutation, to provide low affinity subunits which are capable of 
associating with low binding affinity and thereby undergo complementation to catalyze an 
enzymatic reaction. For example, reporter subunits capable of complementation with low 
binding affinity may be derived firom enzymes such as P-galactosidase, p-glucuronidase 
(GUS), P-lactamase, alkaline phosphatase, peroxidase, chloramphenicol acetyltransferase 
(GAT) and luciferase. Any of a range of enzymes capable of producing a detectable 
product either directly or indirectly may be so modified or may occur naturally. 
Additionally, reporter subunits may be derived from non-enzymatic molecules. For 
example, association of two proteins may generate a unique conformation in one or both of 
the interacting proteins that can be recognized by an antibody or other ligand. 

P-galactosidase, which is encoded by the £. coli lacZ gene, is an enzyme which has 
been developed in the art as reporter enzyme, P-galactosidase activity may be measured by 
a range of methods including live-cell flow cytometry and histochemical staining with the 
chromogenic substrate 5-bromo-4-chloro-3-indolyl p-D-galactopyranoside (X-Gal). Nolan 



et al, Proc. Natl Acad ScL, USA, 85:2603-2607 (1988); and Lojda, Z., Enzyme 
Histochemistry: A Laboratory Manual, Springer, Berlin, (1979), the disclosures of which 
are incorporated herein. 

Enzyme mutants capable of intracistronic complementation are especially suitable 
as reporter subunits. In E. coll deletions of either the N or C terminus of p-gal produce 
enzyme that is inactive yet can be complemented by coexpression with a second inactive 
deletion mutant containing domams lacking in the furst. The N- and C- terminal domains 
involved in complementation are known as the a and g) regions. Ulbnann et al, J. Mol 
BioU 12:91 8-923 (1965); UUman et al,X Mol Biol, 24:339-343 (1967); and UUman et 
alyJ. Mol Biol, 32:1-13 (1968), the disclosures of which are incorporated herein. p-Gal 
complementation systems in manmialian cells are described in Mohler and Blau, Proc, 
Natl Acad ScL USA, 93:12423-12427 (1996), the disclosure of which is incorporated 
herein. As described therein, vectors expressing complementing mutants of p-gal may be 
constructed. Anatuirally occurring /acZ mutation, AM 15 (Beckwith, J. M?7. 5/d/., 8:427- 
430 (1964); and Prentki, Gene, 122:231-232 (1992) and 7\r^/wre, 369:761-766 (1994), the 
disclosures of which are incorporated herein) designated as Aa herem may be constructed. 
Another deletion mutation, designated Aca herein, was made as disclosed herein, and its 
structure is shown schematically in Figure 1 . The peptide region between the a and o 
regions is referred to herein as the \i region, as first defmed by Mohler and Blau, Proc. 
Natl Acad ScL USA, 93:12423-12427 (1996). The Aa and Aco mutants are demonstrated 
herein to have optimal forced complementation properties. These deletion mutants express 
polypeptides representing an a-acceptor/co-donor (Aa) and an a-donor/co-acceptor (Aco). 

p-Gal complementation is based on the ability of mutant enzyme molecules to 
associate and reconstitute an active enzyme. Accordingly, two p-gal molecules that each 
lack one or more structural domains critical to the activity of the holoenzyme, associate to 
form a single functional unit that contains all of the required structural determinants. This 
phenomenon is dependent on the fact that interactions that would normally take place 
between domains of the single peptide of wild type p-gal, can also exist between domains 
present on two distinct peptides, leading to the formation of a stable dimer. This dimer 
behaves functionally as a single peptide of wild type p-gal, and participates ultimately in 
the formation of the tetramer that represents the active form of the enzyme. Thus, the 
ability of a pair of P-gal mutants to recreate an active form of the enzyme is strongly ■ 



dependent on their ability to form a stable dimer and therefore would be expected to be 
dependent on their affinity for each other. 

Surprisingly, it has been discovered that forced association of complementation of 
two distinct low affinity p-gal mutants results in an efficient formation of active enzyme 
molecules in mammalian cells even though they have relatively low affinity for each other. 
The forced complementation results when the two mutant subunits are brought into 
association due to the binding afiSnity of the binding moieties attached to the mutant 
subunits- By engineering constructs in which domains or proteins of interest drive the 
dimerization between Aa and 

A<D p-gal mutants, it is possible to monitor and quantitate such interactions by assessing 
the efficiency of complementation obtained by coexpression of these fusion proteins in 
intact eucaryotic cells. 

In addition to two-component complementation between Aa and A© p-gal mutants, 
the invention also contemplates three-component complementation among mutants each of 
which contains only a single functional a, ^, or co region. Among other applications, this 
might allow detection of interactions among three distinct protems based on a single 
reporter. Similarly, higher-order systems containing four or more reporter components are 
within the scope of the invention. 

Using the fused protein systems, protein-protein interactions and their regulation 
can be studied in mammalian cells without relying on the transcriptional activation of a 
reporter construct. Association of the proteins of interest directly results in enzyme 
activity and is independent from other cellular functions. Therefore this system allows the 
detection of complexes that are excluded from the nucleus, or that involve partners that 
inhibit transcription. Furthermore it allows the detection, quantitation and detenmination 
of the localization of specific binding interactions within cells, as well as the temporal 
distribution of such binding interactions. Binding interactions may be compared in cells at 
different stages of development or differentiation, as well as in normal vs. pathologic cells 
and in infected vs. uninfected cells, to give but a few examples. Binding interactions can 
therefore be assessed against a background of endogenous competing components that may 
differ in nature and in concentration among different cell types. 

Other enzymes may be identified or constructed which are capable of forced 
complementation in the reporter systems described herein. For example, the phenomenon 



of intracistronic complementation of enzymatic activity has been described for tryptoplian 
synthetase, Jackson et al J, Biol Chem., 244:4539-4546 (1969). Complementation 
between mutant subunits of thymidylate synthase has been described. Pookanjanatavip et 
al, Biochemistry y[;AQ3Q3'\Q3m (1992), the disclosure of which is incorporated herein. 
Thus, reporter subimits derived from any complementing enzyme system known in the art 
can be used in the practice of the present invention. Mutants can be derived from other 
enzymes or proteins that are capable of serving as reporters of protein-protein interactions, 
or whose activity can be regulated as described above. The system exploits the 
complementation ability of low bindmg affinity enzyme mutants for detection of protein- 
protein interactions. 

For example, complementing low affinity reporter subunits derived from p- 
lactamase can be constructed. Activity of the complementing P-lactamase can be detected 
using substrates for Prlactamase developed in the art which include a fluorescent donor 
moiety and a quencher, which include an attached group which makes them permeable 
through cell membranes, wherein the attached group ishydrolyzed off after the substrate 
enters the cell. Fluorescence energy transfer between the donor and quencher then can be 
monitored as an indicator of p-lactamase activity, as described in PCT WO 96/30540 
published October 3, 1996. 

In addition to enzymes which catalyze a reaction to produce a detectable product, 
proteins, protein domains orprotem fragments which are themselves detectable upon 
association can be used. Exemplary proteins include green fluorescent proteins, which 
have characteristic detectable emission spectra, and have been modified to alter their 
emission spectra, as described in PCT WO 96/23810, the disclosure of which is 
incorporated herein. Fusions of green fluorescent proteins with other proteins, and DNA 
sequences encoding the fusion proteins which are expressed in cells are described in PCT 
WO 95/07463, the disclosure of which is incorporated herein. 

Other exemplary subunits include subunits which are capable of associating to 
produce a photochemical signal such as a fluorescent or luminescent signal, including 
chemiluminescent or photoluminescent signals. The reporter subunits also may comprise 
fluorophores which are capable of detectable resonance energy transfer when they are 
closely associated, as disclosed, for example, in EP Publication No. 0 601 889 A2 and 
PCT WO 96/41 1 66, the disclosures of which are incorporated herein. 



Other complementing en2ymes are known in the art, for example, pancreatic 
ribonuclease and Staphylococcal nuclease. Mutants of the complementing subunits of 
these enzymes can be constructed, by methods well-known to those of skill in the art such 
as site-directed mutagenesis, to generate low-afiBnity complementing subunits. One 
possible use for these types of complementing protein is as a tumor therapeutic, wherein a 
tumor-specific protein serves as a bridge to bring together two protems, each of which is 
fused to a low-afFmity complementing fi-agment of the nuclease. The resultant nuclease 
activity might, in some cases, kill the cell by destroying mRNA, genomic DNA, etc. 

Binding Moieties 

Binding moieties which can be assayed for their binding affinity with each other 
include any molecules capable of a binding interaction. The binding interaction between 
the two or more binding moieties may be either direct or in the form of a complex with one 
or more additional binding species, such as charged ions or molecules, ligands or 
macromolecules. 

The binding moieties which are attached to the reporter subunit can be any of a 
range of different molecules including carbohydrates, lipids, proteins, and nucleic acids, as 
well as portions, polymers and analogues thereof, provided they are capable of being 
linked to the reporter subunit. Exemplary proteins include members of a signal 
transduction cascade, proteins regulating apoptosis, proteins that regulate progression of 
the cell-cycle or development of tumors, transcriptional regulatory proteins, translational 
regulatory proteins, proteins that affect cell interactions, cell adhesion molecules (CAMs), 
ligand-receptor pairs, proteins that participate in the folding of other proteins, and proteins 
involved in targeting to particular intracellular compartments, such as the Golgi apparatus, 
endoplasmic reticulum, ribosomes, chloroplasts and mitochondria. 

Other exemplary proteins include protein hormones and cytokines. Cytokines 
include those involved in signal transduction, such as interferons, chemokines, and 
hematopoietic growth factors. Other exemplary proteins include interleukins, 
lymphotoxin, transforming growth factors-a and p, and macrophage and granulocyte 
colony stimulating factors. Other proteins include intracellular enzymes such as protein 
kinases, phosphatases and synthases. 

Exemplary proteins involved in apoptosis include tumor necrosis factor (TNF), Fas 
Ugand, interleukin-lp converting enzyme (ICE) proteases, and TNF-related apoptosis- 



inducing ligand (TRAIL). Proteins involved in the cell cycle include deoxyribonuclei'fc 
acid (DNA) polymerases, proliferating cell nuclear antigen, telomerase, cyclins, cyclin 
dependent kinases, tumor suppressors and phosphatases. Proteins involved in transcription 
and translation include ribonucleic acid (RNA) polymerases, transcription factors, 
enhancer-binding proteins and ribosomal proteins. Proteins involved in cellular 
interactions such as cell-to-cell signaling include receptor proteins, and peptide hormones 
or their enhancing or inhibitory mimics. 

Binding of molecules will depend upon factors in solution such as pH, ionic 
strength, concentration of components of the assay, and temperature. In the binding assays 
using reporter systems described herein, the binding affinity of the binding moieties should 
be high enough to permit forced complementation between the reporter subunits. Non- 
limiting examples of dissociation constants of the binding moieties in an assay solution, 
such as a buffered system or cell interior, are on the order of less than about lO 'M, for 
example, less than about 

10 ' M, or optionally, between about IQ-" to 10'"M, depending upon the properties of the 
particular assay system. 

Linking of the Reporter Subimit and the Binding Moiety 

The reporter subunit and one or more binding moieties are generally linked either 
directly or via a linker, and are generally linked by a covalent linkage. For example, when 
the reporter subunit and the binding moiety are proteins, they may be linked by methods 
known in the art for linking peptides. 

In one preferred embodiment, the reporter subunit and the binding moiety comprise 
a fusion protein including the reporter subunit which is a low binding affinity enzyme 
complement and the binding moiety being assayed. The fusion protein can thus be 
expressed from an encoding nucleic acid intracellularly. This system is advantageous 
since it permits the detection and quantitation of protein-protein interactions in cells, such 
as mammalian cells, based on enzymatic complementation of the low affinity reporter 
subunits. 

For example, in the embodiment wherein chimeric fused proteins are produced 
intracellularly that include one of two complementing low affinity p-gal mutants and a 
"test" protein of interest, the detected p-gal activity due to interactions between two 
chimeric proteins of interest will be proportional to the strength of the interaction of the 



non-p-gal protein components. Thus, the interaction is driven by the test proteins of 
interest, not the complementing mutants. The enzymatic activity serves as an indicator of 
that interaction. Another advantage of this system is that only low levels of expression of 
the test proteins are required to detect binding. 

The fusion gene constructs preferably are constructed and transformed into cells to 
produce low level expression. The system then permits the monitoring of interactions in a 
given cell in the presence of endogenous competing protein partners, where the fusion 
protein will function as a "tracer" for the binding/association reaction. Such a system will 
not be prone to artifacts arising from overexpression of introduced proteins. Reduction in 
expression of fusion gene constructs can be accomplished by choice of appropriate 
promoters, ribosome binding sites and other regulatory elements. For example, fusion 
gene constructs can be introduced into vectors in which they lie upstream of an antibiotic 
resistance gene whose translation is regulated by the Encephalomyocarditis virus internal 
ribosome entry sequence (IRES), and which contain a mutation in the splice 
donor/acceptor"sequences upstream of the ATG sequence responsible for translational 
initiation of the fusion giene. This type of construct results in a lower translation efficiency 
of the first coding sequence in a bicistronic message, but does not affect translation of the 
second (antibiotic resistance) sequence, which is solely dependent on the IRES. As a 
result of these reduced levels of expression, the frequency of spontaneous interaction of 
reporter subunits, which is concentration-dependent, will be significantly reduced. 

Expression of Fusion Proteins 

The invention provides fusion proteins between a putative binding moiety and a 
low affinity reporter subxmit. The putative binding moiety may comprise any protein or 
other molecule whose ability to bind to a second molecule is to be tested. The low affinity 
reporter subunit may be any molecule wherein the monomer subunit is inactive, but 
association of two or more identical or different monomers restores activity. The activity 
must be capable of generating a detectable signal. In a preferred embodiment, the low 
affinity reporter subunits comprise mutants of p-galactosidase capable of complementation 
with one another. 

Fusion proteins comprise a single continuous linear polymer of amino acids which 
comprise the full or partial sequence of two or more distinct proteins. The construction of 
fusion proteins is well-known in the art. Two or more amino acids sequences may be 



joined chemically, for instance, through the intennediacy of a crosslinking agent. In a' 
preferred embodiment, a fusion protein is generated by expression of a fusion gene 
construct in a cell. A fusion gene construct comprises a single continuous linear polymer 
of nucleotides which encodes the full or partial sequences of two or more distinct proteins 
in the same uninterrupted reading frame. Fusion gene constructs generally also contain 
replication origins active in eucaryotic and/or procaryotic cells and one or more selectable 
markers encoding, for example, drug resistance. They may also contain viral packaging 
signals as well as transcriptional and/or translational regulatory sequences and RNA 
processing signals. 

The fusion gene constructs of the invention are introduced into cells to assay for 
binding between the putative binding moieties encoded by the fusion gene constructs. The 
fusion gene constmcts may also contain promoters and other transcriptional and/or 
translational regulatory sequences that are normally associated with the gene encoding the 
putative binding moiety. The fusion gene constructs may be mtroduced into cells by any 
method of nucleic acid transfer known in the art, including, but not limited to, viral 
vectors, transformation, co-precipitation, electroporation, neutral or cationic liposome- 
mediated transfer, microinjection or gene gun. Viral vectors include retroviruses, 
poxviruses, herpesviruses, adenoviruses, and adeno-associated viruses. Particularly 
preferred in the present invention are retroviral vectors, which are capable of stable 
integration into the genome of the host cell. For example, retroviral constructs encoding 
integration and packaging signals, drug resistance markers and one or more fusion genes of 
interest are useful in the practice of the invention. 

Different fusion gene constmcts encoding unique fusion proteins may be present on 
separate nucleic acid molecules or on the same nucleic acid molecule. Inclusion of 
different fusion gene constructs on the same nucleic acid molecule is advantageous, in that 
uptake of only a single species of nucleic acid by a cell is sufficient to introduce sequences 
encoding both putative binding partners into the cell. By contrast, when different fusion 
constmcts are present on different nucleic acid molecules, both nucleic acid molecules 
must be taken up by a particular cell for the assay to be functional. Thus, problems of cell 
mosaicism are avoided when both fusion gene constmcts are included on the same nucleic 
acid molecule. 



The fusion gene constructs or fusion proteins of the invention may be introduced 
into cultured cells, animal cells in vivo, animal cells ex vivo, or any other type of cell in 
which it is desired to study protein-protein interactions. 

Assays 

The reporter systems disclosed herein may be used to assay binding interactions of 
putative binding moieties attached to low affinity reporter subunits through 
complementation between the low affmity reporter subunits which produces a detectable 
signal. In addition to testing for direct binding interactions between the putative binding 
moieties, interactions dependent upon one or more additional molecules or ions may be 
evaluated. Further, multi-molecular interactions in living animal cells can be evaluated, as 
well as the influence of various drags, peptides and pharmaceuticals on these interactions. 

In one embodiment, the binding affinity of one or more putative binding moieties 
may be measured by providing a reporter system including one component having one of 
the moieties bound to a low affinity reporter subxmit and at least one other component 
including one other putative binding moiety bound to a second low affinity reporter 
subunit. The binding moieties may be different or the same. In the system, the reporter 
subunits are capable of binding and generating a detectable signal only if they are brought 
into proximity by the binding of the one or more putative binding moieties. The signal can 
be directly or indirectly detected and quantitated. 

In one embodiment of the invention, protein-protein interactions can be detected 
and quantitated. The signal produced by the complementing reporter subimits can serve as 
an indicator of binding between the putative binding moieties, either directly or indirectly 
via a third substance. Signals which could be detected include light emission and 
absorbance. Exemplary signals include chromogenic, fluorescent and Iximinescent signals. 
These signals can be detected and quantitated visually or through the use of 
spectrophotometers, fluorimeters, microscopes, scintillation counters or other 
instrumentation known in the art. 

Bindmg of components of the reporter systems disclosed herein will depend upon 
factors in solution, such as pH, ionic strength, concentration of components of the assay, 
and temperature. Assay solutions can be designed and developed for a particular system. 
The reporter systems disclosed herein can be used to conduct assays in solutions, such as 
buffered cell free solutions, cell interiors, solutions of cells, solutions of cell lysates, and 



solutions of ceil fractions, such as nuclear fractions, cytoplasmic fractions, mitochondrial 
fractions, and membrane fractions. Methods for preparing assay solutions, such as enzyme 
assay solutions, cell extracts, and cell suspensions, known in the art may be used. For 
example, physiologically compatible buffers such as phosphate buffered saline may be 
used. See for example, the series, Methods in Enzymology, Academic Press, New York. 

In one embodiment, the low affinity reporter subunits are capable of 
complementing one another to form an enzymatically active complex that is capable of 
catalyzing the conversion of a substrate to a product which is detectable, either directly or 
indirectly. In one embodiment, the reporter system can include two or more components, 
each of which is a fiision protein, wherein the fiision proteins each comprise a putative 
binding protein fiised to a low affinity reporter subunit. Thus, nucleic acids encoding the 
fiision proteins can be constructed, introduced into cells and expressed in cells. 
Altematively, the bound reporter units or bound binding moieties can be detecting by 
detecting the binding of a labeled specific bindmg moiety such as an antibody to the bound 
complex. 

In one embodiment, the low affinity reporter subunits may be complementing 
subunits of p-gal. The system may include three or more reporter subunits all of which are 
required to associate in order to produce the detectable signal. Methods for detecting the 
reaction products of active P-gal that have been developed in the art may be used. For 
example, p-galactosidase activity may be measured by a range of methods including live- 
cell flow cytometry and histochemical staining with the chromogenic substrate 5-bromo-4- 
chloro-3-indolyl P-D-galactopyranoside (X-Gal). Nolan et al, Proc. Natl Acad Sci, USA, 
85:2603-2607 (1 988); and Lojda, Z., Enzyme Histochemistry: A Laboratory Manual^ 
Springer, Berlin, (1979), the disclosures of which are incorporated herein. Histochemical 
staining for P-gal can be achieved by fixation of cells followed by exposure to X-gal. 

Assays for P-gal activity described in Mohler and Blau, Proc. Natl Acad Set, 
93:12423-12427 (1996), the disclosure of which is hereby incorporated by reference, may 
be used. In one embodiment, intracellular analyses may be conducted by fixing cells and 
staining with the indigogenic substrate X-gal. Fixed cells also can be analyzed by assaying 
for p-gal activity by fluorescence histochemistry using an azo dye in combination with 
either X-gal or 5-bromo-6-chloro-3"indolyl P-D-galactopyranoside (5-6-X-Gal). A 
preferred combination is the azo dye red violet LB (Sigma Chemical, St. Louis, MO) and 



5-6-X-Gal, referred to as Fluor-X-gal. For this combination, fluorescence micrographs can 
be obtained on a fluorescence microscope using a rhodamine/Texas Red filter set. Use of 
these substrates allows, for the first time, P-gal-dependent fluorescence to be visualized 
simultaneously with two or more other fluorescent signals. 

Vital substrates for p-gal, which can be used in living cells, are also encompassed 
by the invention. For example, a vital fluorogenic substrate, resorufin p-galactoside 
bis-aminopropyl polyethylene glycol 1900 (RGPEG) has been described. Minden (1996) 
BioTechniques 20(1}: 122- 129. This compound can be delivered to cells by microinjection, 
electroporation or a variety of bulk-loading techniques. Once inside a cell, the substrate is 
unable to escape through the plasma membrane or by gap junctions. Another vital 
substrate that can be used in the practice of the invention is fluorescein di-p-D- 
galactopyranoside (FDG), which is especially well-suited for analysis by fluorescence- 
activated cell sorting (FACS) and flow cytometry. Nolan et al (1988) Proc, Natl Acad. 
Scl USA 85:2603-2607 andRotman etaL (1963) Proc. Natl Acad Set USA 50:1-6. 

p-gal may also be detected using a chemiluminescehce assay. For example, cells 
containing p-gal fusions are lysed in a mixture of buffers containing Galacton Plus 
substrate from a Galactolight Plus assay kit (Tropix, Bedford MA). Bronstein et al , I 
Biolumin. Chemilumm., 4:99-1 1 1 (1989) the disclosure of which is incorporated herein, 
After addition of Light Emission Accelerator solution, luminescence is measured in a 
luminometer or a scintillation counter. 

Reporter systems other than p-gal may also be used in the practice of the invention. 
For example, the enzyme P-glucuronidase (GUS) can be used as a reporter and 
chromogenic and fluorogenic GUS substrates have been developed. The GUS substrate 5- 
bromo-4-chloro-3-indolyl p-D-glucuronic acid (X-gluc) can be used in both chromogenic 
and fluorogenic applications, as follows. In one method of chromogenic staining, fixed 
cells are washed in PBS and stained with 2 mM X-gluc (Molecular Probes, Eugene OR), 
10 mM EDTA, 0.5 mM K3Fe(CN)6, 0.5 mM K4Fe(CN)6, 0. 1% Triton X-100, 0.1 M 
NaP04. Fluorogenic staining may be achieved by using a combination of 5-bromo-6- 
chloro-3-indoIyl p-D-glucuronic acid (5, 6 X-gluc, Molecular Probes, Eugene, OR) and 
Fast Red Violet LB (Sigma Chemical, St. Louis, MO), Fixed cells are rinsed with PBS 
and stained in 50 ^g/ml 5, 6 X-gluc and 100 ^ig/ml Fast Red Violet LB, then rinsed in PBS. 



Fluorescence is detected on a fluorescence microscope adjusted for detection of rhodkmine 
fluorescence. 

In one embodiment of the invention, the reporter subunits comprise an en2yme and 
an inhibitor of the en2yme. Preferably, the inhibitor has a low affinity for the enzyme. In 
this case, association between the putative binding moieties is evidenced by inhibition of 
the activity of the enzyme. Exemplary enzymes include p-gal, GUS, P-lactamase, etc. 

While dimeric reporter subunit complexes are discussed herein by wray of example, 
multimeric reporter subunits also can be used, as can reporter subunits which are only 
active in the presence of one or more additional molecules or atoms. An example of a 
trimeric reporter subunit system would be one consisting of a p-gal co donor (such as a 
Aa-Aji double mutant), a p-gal ^ donor (such as a Aa-Ao) double mutant) and a p-gal a 
donor (such as a Ajli-Aco double mutant), wherein each individual mutant, and any pairwise 
combination of two mutants, is enzymatically inactive. Activity would be obtained only if 
all three subunits were able to associate vwth one another. Enzyme reaction products can 
be detected using methods available in the art, such as biochemical assay, microscopy, 
flow cytometry, light emission or absorption detection, and immunological methods. 

The methods disclosed herein enable the detection and quantitation of binding 
events in cell lysates, as well as in intact cells. Thus, interactions between fully folded 
proteins are detectable, and co-translational expression of the binding moieties is not 
necessary for binding to be detected. 

In the practice of the invention, the reaction product may be detected indirectly, for 
example, through immunological techniques, such as immunofluorescent labeling. 

Protein-protein interactions can be measured in a reporter system which includes 
one or more fusion proteins. The fusion proteins each include a putative binding protein 
coupled to a low affinity reporter subunit For intracellular expression of the fusion 
proteins, one or more fusion gene constructs are prepared which include sequences 
encoding the fusion protein(s). The fusion gene constructs may be introduced into cells by 
methods available in the art, including, but not limited to, viral vectors, transformation, co- 
precipitation, electroporation, neutral or cationic lipospme-mediated transfer, 
microinjection or gene gun. 

A variety of cell-based assays can be conducted using the cells containing the 
fusion gene constructs. Binding of the putative binding moieties on the fusion proteins 



expressed in the cells can be confirmed by detecting the signal produced by the reporter 
subunits undergoing forced complementation. Thus, for example, when the reporter 
subunits are complementing p-gal subunits, cells exhibiting p-gal activity indicate binding 
between the putative binding moieties within those cells. 

The fusion gene constructs may also contain promoters and other transcriptional 
and/or translational regulatory sequences that are normally associated with the gene 
encoding the putative binding moiety. This permits the study of physiologically-relevant 
levels of the putative binding proteins in vivo, in contrast to systems in which test proteins 
are overexpressed. Further, this permits the study of naturally-occurring changes in levels 
of binding activity over time and can reveal the effects of endogenous or exogenous 
substances on binding interactions. 

The methods and compositions of the invention can also be used to study other 
molecules which influence the interaction of two putative binding partners. Proteins, 
peptides, nucleic acids, carbohydrates, lipids, ions, small molecules, synthetic compounds 
or other substances (either endogenous to the cell or exogenously added) may act as either 
agonists or antagonists of a binding interaction. By measuring the effect of such molecules 
on, for example, p-gal activity produced by cells containing two or more fusions 
representing a particular pair of test proteins, agonist or antagonist activity of such 
molecules can be determined. Use of the methods and compositions of the invention will 
allow high-throughput assays to be carried out to test for agonists or antagonists of a 
particular binding interaction. Such high-throughput assays will be especially valuable in 
screening for drugs that influence medically-relevant protein-protein interactions. 

Putative binding partners, or putative binding moieties, as used in the invention, 
can include molecules which do not normally interact with each other, but which each 
interact with a third molecule so that, in the presence of the third molecule, the putative 
binding partners are brought together. Thus, substances which influence an interaction 
between putative binding partners include those which stimulate a weak interaction 
between putative bindmg partners, as well as one or more molecules which mediate 
interaction between molecules which do not normally interact wdth each other. In addition, 
substances which influence an interaction between putative binding partners can include 
those which directly or indirectly affect an upstream event which results in association 
between the putative binding partners. For example, if phosphorylation of one of the 



putative binding partners endows it with the capacity to associate with another of the' ' 
putative binding partners; substances which influence the interaction of the putative 
binding partners include those which directly or indirectly affect a kinase activity. 

Assays can be developed as disclosed herein to examine the effect on 
intermolecular interactions of a variety of compositions including drugs such as antipyretic 
and anti-inflammatory drugs, analgesics, antiarthritics, antispasmodics, antidepressants, 
antipsychotics, tranquilizers, antianxiety drugs, narcotic antagonists, antiparkinsonism 
agents, cholinergic antagonists, chemotherapeutic agents, immunosuppressive agents, 
antiviral agents, parasiticides, appetite suppressants, antiemetics, antihistamines, 
antimigraine agents, coronary vasodilators, cerebral vasodilators, peripheral vasodilators, 
hormonal agents, contraceptives, antithrombotic agents, diuretics, antihypertensive agents, 
cardiovascular drugs, opioids, and vitamins. 

Protein-protein interactions mediated by a third molecule can be detected and 
quantitated. The kinetics of binding also can be studied. An example of such a system is 
described in Exaniples 1 and 2 below, wherein P-gal fusion proteins are used to monitor 
the rapamycm-mediated interaction of the FKBP 12 and FRAP proteins. Belshaw, P. J. et 
al,Proc. Natl. Acad 5c/. m4, 93: 4604-4607 (1996); Brown et al., A/a/wr^ 369: 756-758 
(1994); Chen, et al^ProcNatl Acad ScL, USA, 92:4947-4951 (1995); and Choi, J. et al. 
Science, 273:239-242 (1996). For example, kinetics of binding can be detemiined by 
measuring p-gal activity at different times following addition of rapamycin to cultures of 
cells expressing fusions of FKBP12 and FRAP to two complementing, low affinity Prgal 
mutants (e.^., Aa and Aco). A dose-response curve can also be obtained, in which the 
extent of binding, as measured by P-gal activity, is determined as a function of rapamycin 
concentration. 

This assay can be adapted to control for the possible effect of a protein component 
on its fusion partner, thereby enabling the study of protein-protein interactions in a 
quantitative fashion. In one such control system, tripartite fusion constructs including a 
reporter subunit, a binding protein and the protein of interest are provided. As described 
. below in Example 3, in one embodiment, the fusion protein includes 1) a P-gal mutant 
portion, 2) a FKBP 12 or FRAP portion, and 3) a test protein portion. The most N-terminal 
component is the test protein, followed by FKBP12-Aa> or FRAP-Aa. The presence of 
FKBP 12 and FRAP in these constructs allows rapamycin-mediated dimerization of the 



fusion proteins. The absolute values of p-gal activity obtained by simple co-expression of 
a fusion containing a test protein of interest and fusions containing different potential 
interacting partners is determined. In parallel samples, p-gal activity is measured upon 
induction of complementation with a fixed amount of rapamycin. The ratio of p-gal 
activity obtained in the absence and the presence of rapamycin indicates the relative 
abilities of the different protein pairs to interact with each other. 

A further advantage of the tripartite fusion system is that the presence of the 
FKBP12 and FRAP components provides a flexible hinge domain between the p-gal 
mutants and the test protein. This reduces the possibility of interference between the p-gal 
component and the test protein. Furthermore, it allows direct testing of the functional 
integrity of the p-gal components in the fusions without the need for reclonmg into more 
efficient viral vectors. For example, the tetracycline repressor, tetR, forms homodimers in 
mammalian cells with high efficiency. Hinrichs et al (1994) Science 264:41 8-420. 
Coexpression of ^e/^-FKBP12-Aa) and Ye/7J-FRAP-Aa fusions yielded P-gal-positive cells, 
as shown in Example 3, showing that it is jpossible to construct functional tripartite fusions, 
in which dimerization of the N-terminal peptide component efficiently drives 
complementation of the C-terminal mutant p-gal polypeptides, with the FKBP12 and 
FRAP components serving as both intemal standards and flexible hinges. 

The system may be further tested and compared by constructing fusions between 
each p-gal mutant and the complete coding sequence of MEF2c. Since MEF2c is known 
to form homodimers in vivo, coexpression of both p-gal mutants fused to MEF2c should 
result in readily detectable enzymatic activity. MEF2c mutants that are impaired in their 
dimerization potential arc available and fusion of one of the mutants to one of the p-gal 
mutants can serve as a negative control to further validate the system. Molkentin, et al,, 
Mol Cell BioL, 16:2627-2636 (1996). 

The reporter system can also be designed with controls to permit the quantitation of 
the expression level of the P-gal fusion proteins. This will make it possible to control for 
potential differential expression of the two (or more) fusion proteins. For example, a 
peptide tag for which well-characterized monoclonal antibodies are available may be fused 
in frame at the C-terminus of each p-gal mutant. Different tags, such as flag and myc may 
be used for Aa and A©, to allow differential detection of the two mutants even when 
coexpressed in the same cells. In parallel with the determination of P-gal activity in the 



lysates of these cells, an ELISA assay can determine the precise amount of each P-gal/ 
fusion protein in the same lysates. First, a polyclonal anti-P-gal antiserum may be used to 
immobilize the antigens. Then the monoclonal antibody directed against the appropriate 
tag followed by an enzyme-lmked anti-mouse secondary antibody may be used to quantify 
the amount of the P-gal fusion protein of interest Such an approach, employing well- 
characterized techniques, should allow a determination of the expression levels of each 
fusion protein. This modification will be useful where the attached tag does not impair the 
binding of the protein or the ability of the reporter subunits to complement. 
Applications of the Invention 

As will be apparent to one of skill in the art, the invention allows, for the first time, 
a broad range of studies of multiprotein and other types of multi-molecular interaction to 
be carried out quantitatively or qualitatively in living cells. In what follows, non-limiting 
examples of different applications of the invention are provided. 

The observation that levels of p-gal activity in the presence and absence of forced 
complementation can be distinguished, both biocheniically (Figure 5) and by FACS 
(Example 10 and Figure 6), suggests that the methods of the invention can be used to 
screen for new binding partner(s) for a given target protein. In this embodiment, the target 
protein, fused to a weakly-complementing p-gal mutant is stably expressed in a well- 
characterized cell line. Expression libraries containing cDNAs fused to a weakly- 
complementing P-gal mutant are introduced into these cells using, for example, retroviral 
vectors {e.g., Kitamura e/ al, Proc Natl. Acad. Sci. USA ^:9146-9150 (1995) ) or any 
other means of gene transfer known in the art. Vectors expressing gene products that 
interact with the target protein are isolated by identifying p-gal-positive clones. An 
advantage of this system is that the screen can be carried out in any cell type, regardless of 
the cell's milieu of endogenous (and potentially competing) proteins. A further possibility 
for this type of system is that the target protein can be localized to a specific cellular 
compartment, with the aim of identifying proteins involved in interactions restricted to that 
particular location. 

The use of fluorescence-activated cell sorting techniques is particularly well-suited 
to this embodiment of the invention. For example, p-gal-positive cells which contam 
cDNAs expressing gene products that interact with the target protein will generate a signal 
that will allow such cells to be purified by cell-sorting techniques. Such cDNAs could be 



delivered, for example, using retroviral vectors that allow introduction of high complexity 
cDNA libraries with high infection efficiency. 

The assays and methods of the invention can also be carried out in the presence of 
extracellular signaling molecules, growth factors or differentiation factors, peptides, drugs 
or synthetic analogs, or the like, whose presence or effects might alter the potential for 
interaction between two or more given proteins in a particular cell type. 

Detection of molecular interactions, using the methods and compositions of the 
invention, is not limited to those occurring in the nucleus, nor is it limited to intracellular 
interactions. For instance, interactions involving surface receptors can be detected in the 
practice of the invention. In one embodiment, the invention provides new techniques for 
detecting ligand-induced dimerization of surface receptors in living cells. Dimerization, or 
higher order oligomerization, of cell surface receptors is often a prerequisite for receptor 
activation and ensuing signal transduction. For example, the binding of epidermal growth 
factor (EGF) to its receptor stabilizes the dimerization of the receptor and leads to 
activation of its tyrosine kinase activity. Schlessinger et al (1992) Neuron 9:383-391; 
Ulhich et al (1990) Cell 61:203-212; and Weiss et al (1997) Curr, Opin, Genet Dev, 
7:80-86. Example 11, infra, discloses the use of p-gal complementation to monitor 
membrane receptor dimerization in living cells. For this purpose, the weakly 
complementing Aa and A© deletion mutants of P-gal were fused to the extracellular and 
transmembrane regions of the human EGF receptor to form a chimeric receptor molecule 
(see Figure 7A). Deletion of the cytoplasmic domain of the receptor prevents the 
internalization and degradation of the receptor that is normally observed following EGF 
stimulation of cells (Livneh et al (1986) 1 Biol Chem, 261:12490-12497), permitting an 
analysis of receptor dimerization over time in changing conditions. The results presented 
in Example 1 1 demonstrate that this embodiment of the invention can be used to detect a 
previously-unrecognized mode of regulation of EGF receptor signaling, in which EGF 
receptor tyrosine kinase activity acts as a feedback inhibitor limiting the dimerization of 
the receptor. 

The practice of the invention is not limited to detection of interaction between two 
different molecules. Multimerization of a molecule can also be detected using the methods 
and compositions of the invention. In this regard, Example 11 discloses the detection of 
receptor dimerization through the practice of the invention. 



By combining the methods and compositions of the invention with state-of-th'p-art 
methods for construction of high-titer, high-complexity cDNA libraries in retroviruses 
(e.g., Pear et al, (1 993) Proc. Natl. Acad. Sci. USA 90:8392-8396), it will be possible to 
identify interaction partners of a specific test protein in mammalian cells {i.e., perform 
functional genomics at the protein level). For this application, construction of cDNA 
libraries in retroviral vectors wherein the cDNA coding sequence is fused to a sequence 
encoding a low affinity reporter subunit will be used. A sequence encoding a binding 
protein of interest will be fused to a low afFmity reporter subunit in a first retroviral vector. 
In a second series of retroviral vectors, a second complementing low affinity reporter 
subunit will be fiised to a variety of different proteins that will be tested for their ability to 
bind to the protein of interest. Testing will be conducted by co-infection of cells with the 
first and one of the series of second retroviral vectors. Those test proteins which are 
capable of binding to the protein of interest will allow detection of a reporter signal in cells 
in which they are co-expressed with thp protein of interest. Thi^ application will also be 
useful in screening for agonists and antagomsts of medically-relevant protein interactions. 

In one embodiment of the invention, cells in which a protein encoded by one of the 
series of second vectors is able to mteract with the binding protein of interest encoded by 
the first vector are detected and isolated by flow cytometry or fluorescence-activated cell 
sorting (FACS). Methods for flow cytometry and FAGS are well-known in the art; e..g., 
Nolan et al. (1988) Proc. Natl Acad Sci. USA 85:2603-2607; Webster et al., Exp. Cell 
Research. 174:252-265 (1988); and Parks et al (1986) in The Handbook of Experimental 
Immunology, (eds. Weir, D.M., Herzenberg, L.A., Blackwell, C.C. & Herzenberg, L.A.), 
Blackwell, Edinburgh, 4th edition, pp. 29.1-29.21. In this way, clones of cells in which 
binding occurs can be isolated and propagated for fiirther study. This aspect is particularly 
suited for studies of developmental mechanisms, wherein it is possible to select a 
population of cells in which a particular developmentally-relevant interaction has occurred 
and study the fiirther development of that cell population, while at the same time, studying 
the fiirther development of cells in which the interaction has not occurred, for comparison. 
In a similar fashion, the practice of the invention makes it possible to isolate and/or study 
the fiirther development of cells exhibiting interactions involving protein such as 
transcriptional regulatory proteins, translational regulatory proteins, DNA replication 
proteins, mRNA splicing proteins, proteins involved in signal transduction, proteins 



involved in cell-cell and cell-substrate adhesion (for example, cell movement, axon 
guidance and angiogenesis), oncogene products, tumor suppressors, proteins involved in 
cell-cycle control and viral proteins, such as those involved in regulation of viral 
replication, virus-host interactions and virus assembly, and proteins w^hich are subunits, 
crosslinkers, modifying agents or molecular motors within the cytoskeleton of cells. 

For a given target protein whose gene is capable of being fused to a low-affinity 
complementing reporter subunit, it is possible to identify known and heretofore xmknown 
proteins or other endogenous or extraneous substances with which it interacts, by using the 
compositions and methods of the invention. In like manner, for a sequence which encodes 
a protein of unknown function, such as may be obtained from a nucleic acid sequence 
database, (or a plurality of sequences such as a cDNA library) the practice of the invention 
allows one to identify molecules with which the encoded protein interacts. The identity of 
the interacting molecule(s) is likely to provide information with respect to the structure 
and/or function of the unknown protein. Thus, the practice of the invention will likely aid 
in the identification and characterization of newly-discovered proteins and protein-coding 
nucleic acid sequences. 

In another aspect of the invention, a shotgun approach to the identification of 
protein-protein interactions can be taken by generating a first set of constructs which will 
express the encoded products of one cDNA library fused to a first low-affinity 
complementing subxmit and a second set of constructs which will express the encoded 
products of a second (or the same) cDNA library, fused to a second low-affinity 
complementing subunit. Co-expression of the two sets of constructs and selection of cells 
in which complementation occurs will allow the isolation of clones and the identification 
of cDNAs which encode interacting partners. One or both of the interacting partners may 
be known; alternatively, both of the interacting partners may represent heretofore 
unidentified proteins. If both partners are known, new information about their binding 
specificity may be obtained. If one partner is known, it may provide information on the 
function of the unknown binding partner. If neither are known, the observation that they 
interact may assist in the eventual identification of one or both of the interacting pair. 

The invention may be applied to studies of the mechanisms that regulate either 
homo- or hetero-dimerization or multimerization of specific molecules, including high 



efficiency screening to identify syntlietic or naturally occurring compounds capable of 
influencing such dimerization. 

The invention can be used for investigations relating to the localization of specific 
complexes within mtact cells, or intact animals. Types of cells which can be used are 
primary or established cell lines and other types of embryonic, neonatal or adult cells, or 
transformed cells (for example, spontaneously- or virally-transformed). These include, but 
are not limited to fibroblasts, macrophages, myoblasts, osteoclasts, osteoclasts, 
hematopoietic cells, neurons, glial cells, primary B- and T-cells, B- and T-cell lines, 
chondrocytes, keratinocytes, adipocytes and hepatocytes. 

It is also possible, through practice of the invention, to devise systems for 
regulation of enzyme activity by regulating the association of complementing mutants. 
This aspect of the invention has potential applications to human therapy, as a method to 
regulate the enzyme-driven conversion of pro-drugs into their active forms. 

Processes involving molecular interactions, particularly protein-protein 
interactions, which can be studied in the practice of the invention include, but are not 
limited to, transcription, translation, replication, mitosis, grovrth control, progression and 
regulation of the cell-cycle, apoptosis, cell-cell, cell-substratum and cell-ligand 
interactions, intracellular signal transduction cascades, oncogenesis, cell lineages, and 
embryonic development. Examples of cell ligands include leptin and growth factors such 
as epidermal growth factor (EGF), nerve growth factor (NGF), platelet-derived growth 
factor (PDGF), and insulin-like growth factors I and II (IGF-I and IGF-II), transforming 
growth factors a and p (TGF-a and TGF-p), endorphins and endorphin receptors, 
prostaglandins and their receptors, cytokines and then* receptors, neurotransmitters and 
their receptors, adrenergic receptors, and cholinergic receptors. Receptors which could 
mteract with ligands include EGF, NGF, and PDGF receptors and leptin receptors. 
Analysis of EGF receptor dimerization, using the methods and compositions of the 
invention, is provided in Example 1 1 , infra. 

Additional interactions that can be studied by the practice of the invention include 
interactions involved in cell metabolism and cell structure. These mclude, but are not 
limited to, interactions that are involved in energy metabolism or which establish or 
modify the structure of the membranes, cytoplasm, cytoskeleton, organelles, nuclei, 
nuclear matrix or chromosomes of cells. Interactions among constituents of the 



extracellular matrix, or between constituents of the extracellular matrix and cells, can also 
be studied with the methods and compositions of the invention. 

The invention will be further understood by the following non-limiting examples. 

EXAMPLES 

Example 1 : Preparation and Transfection of Retroviral Construct Encoding a 3- 
Galactosidase Reporter System. 

A reporter system using p-galactosidase ("P-gal") complementation to evaluate 
protein-protein interactions was constructed. A well-characterized protein complex 
developed by Schreiber was used as a test system to provide the protein binding moieties. 
Belshaw, P. J. etal, Proc, Natl Acad Set USA, 93: 4604-4607 (1996); Brown et al., 
Nature 369: 756-758 (1994); Chen, etai, Proc. Natl, Acad ScL, USA, 92:4947-4951 
(1995); and Choi, J. et aL, Science, 273:239-242 (1996), the disclosures of which are 
incorporated herein. In this protein complex, the intracellular rapamycin binding protein, 
FK506-binding protein-12 (FKBP 12), interacts with intracellular FKBP-rapamycin ^ 
associated protein (FRAP) only when rapamycin is present in the culture medium, an 
interaction that increases with the dose of rapamycin. Rapamycin is a small, cell-permeable 
molecule that binds to the two intracellular proteins via independent determinants. Since 
rapamycin is unable to bind two FKBP 12 molecules at the same time and FRAP only 
bmds rapamycin within the FKBP 12-rapamycin complex, only heterodimers should form 
upon rapamycin treatment. Ho, S. N. et al. Nature, 382:822-826 (1996), the disclosure of 
which is incorporated herein. 

The p-gal system was combined with the FKBP 1 2/FRAP/rapamycin system as 
follows. Two different retroviral constructs were produced, each encoding fusion proteins 
of the A© or Aa p-gal mutants, and either FKBP 12 or the FKBP-rapamycin binding 
domain of FRAP, respectively (FKBP12-AcD and FRAP-Aa). 

The Aa or Aco p-gal mutants were obtained as described in Mohler and Blau, Proc. 
Natl Acad. 5cz,, 93:12423-12427 (1996), the disclosure of which is incorporated herein. 

To fuse the sequences coding for FKBP 12 and the FKBP 12-rapamycin binding 
domain in frame with p-gal, an adapter oligonucleotide (CATGGAGCTCGAGAG) 
containing an Xhol site was inserted in the Ncol site at the ATG of the Aa and A© p-gal 
mutants described by Mohler and Blau, supra. Two Xhol-Sall DNA fragments 



corresponding to amino acids 2025-21 14 of human FRAP and to the complete coding- 
sequence of human FKBP12 were cloned in the Xhol site of the modified Aa and Aco 
mutants, generating FRAP-Aa and FKBP12-Aa). Conservation of the appropriate reading 
fi-ame was confirmed by sequencing for both constructs. 

To insert the FRAP-Aa and FKBP12-Aco coding sequences in the pWZL-Neo and 
pWZL-Hygro retroviruses, an adapter oligonucleotide containing Ncol and BamHI sites 
(GATCACCATGGACGCGTGGATCCC) was inserted in the BamHI and Xhol sites of 
the pWZL vectors. Both the original sites were destroyed by this insertion. The FRAP-Aa 
and FKBP12-Aa) coding sequences were then inserted in the modified pWZL vectors as 
NcoI-BamHI fiagments. 

The cDNAs encoding FKBPU-A© and FRAP-Aa were inserted into a mouse 
ecotropic retroviral vector upstream of the hygromycin resistance or neomycin resistance 
genes, respectively, as described above. By using an Encephalomyocarditis virus internal 
ribosomal entry sequence (IRES), introduction of ai single retroviral vector ensured . 
production of a bicistronic mRNA and translation of both the Aa -p-gal-FRAP-protein and 
the drug selectable hygromycin protein. A second retroviral vector yielded the Ao-P-gal- 
FKBP 12 protein and neomycin resistance protein. 

For virus production and infection, proviral constructs were introduced into 
packaging cells by calcium phosphate transfection. The supernatant media containing 
retrovirus fi-om the packaging cells was harvested 24 to 72 hours after transfection and 
used to infect C2C12 cells in the presence of 8 ng/mL polybrene. Singly and doubly 
infected cells were selected with the appropriate drugs. Both Geneticin and Hygromycin 
were used at a final concentration of 1 mg/ml. The selected cells were expanded as 
populations for subsequent experiments. 

Although the background (J-gal detected with the Aa and A© mutants expressed 
fi-om MFG retroviral vectors described previously (Dhawan etal. Science, 254: 1509-1512 
(1991) was relatively low (Mohler, W. A., & Blau, H. M., Proc. Natl. Acad. Sci. USA, 
93: 12423-12427 (1 996), the disclosure of which is incorporated herein), it was fiirther 
reduced by using retroviral vectors with point mutations that deleted the splice 
donor/acceptor sequences upstream of the p-gal ATG (pWZL). These mutations result in a 
lower translation efficiency of the first coding sequence contained in the vector, but do not 
affect the expression of the selectable marker, which is solely dependent on the IRES. 



Using this vector, two-fold less of the upstream protein was expressed compared to vectors 
containing the same LTRs (long terminal repeats) and the wild-type splice donor/acceptor 
sequences. Such a reduction in translation reduces the concentration of the fusion protein 
and consequent spontaneous interactions of P-gal mutants irrespective of the test proteins 
to which they are fused. As a result, in preliminary experiments, the backgroimd enzyme 
activity measured by luminometer for Aa and Ao) p-gal mutants alone was reduced from 
low to essentially undetectable. 

Infectious viral particles were produced by transient transfection of each construct 
shown in Figure 2a into a packaging cell line modified from that described by Pear et al , 
(1993) Proc. Natl Acad. Scl USA 90:8392-8396 by calcium phosphate transfection. The 
supernatant media containing retrovirus from the packaging cells was harvested 24 to 72 
hours after transfection and used to infect C2C12 cells in the presence of 8 ^xg/mL 
polybrene. C2C12 myoblasts were infected either singly with each retrovirus alone or 
simultaneouisly with both. All experiments were performed after selection with 
hygromycin and G418 to ensure that 100% of the cells contained both constructs. Both 
Geneticin and hygromycin were used at a final concentration of Img/ml. The selected 
cells were expanded as populations for subsequent experiments. 

Example 2 : Assays of Binding and Activity of the p-Galactosidase Reporter 
System. 

Following the addition of rapamycin to the media, the transfected cells obtained as 
described in Example 1 were assayed for P-gal activity. As shown in Figure 3, C2C12cells 
expressing both FKBP12-Aa) and FRAP-Aa were tested by exposure to 10 ng/ml 
rapamycin (Figure 3b) for 12 hr or to no drug at all (Figure 3a). Only those cells 
expressing both constructs exhibited p-gal activity^ readily visualized by X-gal staining of 
fixed cells (Figwe 3b). It is advantageous that cytoplasmic staining is detectable with this 
method, in contrast to prior methods such as the yeast two-hybrid system, which report 
only nuclear interactions. X-gal staining was conducted as follows: Cells were fixed 5 
minutes in PBS plus 4% paraformaldehyde and rinsed in PBS prior to staining. 
Indigogenic X-gal staining was performed overnight at 37°C in PBS plus 1 mg/mL X-gal, 
1 mM MgCl2, 5 mM K3Fe(CN)6, 5 mM K4Fe(CN)6. 

The kinetics of activation of P-gal upon rapamycin treatment were determined. 
C2C12 cells expressing both fiision proteins were plated in replicate in 96 well plates. 



Rapamycin was added to the culture medium, and the p-gal activity measured at different 
time points. For each tune point, six replicate samples were assayed with a sensitive 
chemiluminescence assay, as described in Mohler, W. A., & Blau, H. M., Proc. Natl. 
Acad. Set, USA, 93:12423-12427 (1996), the disclosure of which is incorporated herein. 
In the assay, cells cultured in microtiter plates were lysed in situ m 50 of a 1 :3 mixture 
of lysis and assay buffers containing Galacton Plus substrate from the Galactolight Plus 
assay kit (Tropix, Bedford, MA). Reactions proceeded for 1 hour at room temperature. 
After addition of Light Emission Accelerator solution, luminescence was measured in a 
scintillation counter. 

The results, shown in Figure 4, indicate that the interaction assays for the fiision 
proteins are specific, and exhibit similar kinetics and a comparable dose-response curye to 
assays of the FKBP12/FRAP/rapamycin protein complex alone. Ho, S. N. et al. Nature, 
382:822-826 (1996). Rapamycin induced a 30-fold increase in p-gal activity within 5 
hours. As a control, no rapainypin was added, and no prgal activity was detected above 
background. As a, second control, in cell population? expressing only one of the two 
constructs, p-gal activity did not increase above background when r^amycm was added. 

In Figure 4b, the dose response curve is shown. The activation of P-gal was 
dependent on the dose of rapamycin, which appeared linear over a range of 0 to 10 ng/ml 
of the drug. This linearity provides support that p-gal enzymatic activity can serve as a 
reporter to quantitate protein-protein interactions. The close approximation of both the 
dose response and the kinetics to that observed by others (Ho, S. N. al , Nature, 
382:822-826 (1996)) suggests that the fusion to p-gal peptides is not interfering with the 
interaction of the FKBP12 and FRAP proteins. Moreover, endogenous FKBP12 and 
FRAP proteins are ubiquitously expressed and are capable of interacting with each other or 
with the fusion proteins in the presence of rapamycin, without generating p-gal activity. 
Detection of p-gal activity, as shown above, indicates that productive FRAP-Aa and 
FKBP12-A(D dimers will form in a cellular environment containing competing endogenous 
proteins, and that the resultant p-gal activity reflects the interaction of FRAP and FKBP12- 
rapamycin Thus, the p-gal fusion proteins can be used to monitor the interaction of 
proteins in the FKBP12/FRAP/rapamycin complex and in other types of multiprotein 
complexes. 



It is also possible to detect and quantitate binding activity in cell lysates. As shown 
in Figure 5, cells expressing both FKBP12-Ao) and FRAP-Aa fusion proteins were 
expanded in the absence of rapamycin and lysed. 100 ng/ml rapamycin was added to half 
of the samples, and the p-gal activity in the treated and untreated lysates was determined 
immediately (white bar), after one hour (black bar) or after 3 hours (gray bar). A greater 
than two-fold increase in p-gal activity was observed in the rapamycin-treated lysates one 
hour after administration of the drag. In control lysates that were not exposed to 
rapamycin, no statistically significant increase in p-gal activity was detected. The ability 
to detect and quantitate protein-protein interactions in cell lysates using the methods and 
compositions of the invention indicates that interactions between mature, fiilly-folded 
proteins can be detected and quantitated; co-translational assembly of complexes in not 
required for formation of complexes that can be monitored by p-gal activity. 

Example 3 : Tripartite fiisions for the quantitation of protein-protein interactions. 

To pemiit protein interactions to be studied in a quantitative manner in the system 
described in the above Examples and to control for effects on either the binding ability of 
the binding moiety or the complementing ability of the reporter subunits resulting from 
both activities being present in a single fusion protein, additional modifications were made 
to monitor the expression of the components of the system. In the above described system, 
the P-gal fusion proteins will be expressed from the same viral promoter, however, for 
some proteins, it is possible that their expression level will be influenced by the specific 
fiision partner. In particular, some proteins or domains could affect the stability or 
conformation of the p-gal domain. As a result, differences in the ability of the test proteins 
(the putative binding moieties) to complement one another could be observed that are not 
based on a physiological mechanism. 

In order to avoid these problems, fusions containing three components (p-gal 
mutant, FKBP12 or FRAP, and the test protein) were constructed. The most N-terminal 
component is the test protein, followed by FKBP12-Aco or FRAP-Aa (see the exemplary 
system in Figure 2b, where the test protein portions of the fusion are indicated by X and 
X'). The presence of the FKBP12 and FRAP portions allows raparaycin-mediated 
dimerization of these fusions, and the efficiency of P-gal complementation in the presence 
of rapamycin appears to be dependent on the FKBP12/FRAP/rapamycin interaction. The 
absolute values of p-gal activity obtained by simple coexpression (in the absence of 



rapamycin) of fusions containing a fixed protein of interest and different interacting ; 
partners was determined. In parallel samples, p-gal activity was measured upon induction 
of complementation with a fixed amount of rapamycin. The ratio between the P-gal 
activity obtained in the absence or in the presence of r^amycin indicated the relative 
ability of the different protein pairs to interact with each other. An added advantage of this 
approach is that the presence of the FKBP12 and FRAP domains provide a flexible hinge 
between the p-gal mutants and the putative binding moieties that are being analyzed. This 
reduces the possibility of interference between p-gal and the proteins of interest. 
Furthermore, it allows direct testing of the functional integrity of the p-gal components in 
the fusions without the need for recloning into more efficient viral vectors. 

Results were obtained with tetR-FKBP12-AQ) or tetR-FRAP-Aa tripartite fusions 
(see example in Figure 2b). Coexpression of these constructs, in which dimerization is 
driven by the tetracycline repressor (tetR) protein (Hinrichs, W. et al. , Science, 264:418- 
420 (1^94), the disclosure of which is incorporated herein), readily yielded P-gal positive 
cells. This indicates that fimctipnal tripartite fusions can be constructed, in which the ' 
dimerization of the most NTterminal peptide component efficiently drives complementation 
of the C-terminal p-gal deletion mutant polypeptides. 

Example 4: Dimerizati on of myogenic regulators using complementinp p-p al 
fusion proteins 

The p-gal complementation system is used to assay for the dimerization and 
nuclear translocation of HLH proteins (helix-loop-helix proteins, Murre et al. (1989) Cell 
56:777-783) including activators of muscle-specific proteins (myoD, myogenin, myfS, 
MRF-4), inhibitors of myogenesis (Id, Mtwist, I-mf) and ubiquitous E2A-type proteins 
(E47, E12, HEB). 

In a first step, a myoD-Aa-p-gal (myoD-Aa) fusion construct and a E12-A(D-P-gal 
(E12-AQ)) fusion construct are engineered in selectable retroviral vectors, as described 
above for FRAP-Aa and FKBP12-Aq). The two constructs are transduced into C2C12 
myoblasts. Following selection with the appropriate drugs for cells which express both 
constructs, p-gal activity is quantitated using the chemiluminescent assay described above. 
P-gal activity indicates that heterodimerization of the fiision proteins is occurring in this 
cell type. If p-gal activity is detected, individual cells are analyzed using a fluorescent X- 
gal stain in order to determine if the heterodimers are present in the nucleus. Since wild- 



type p-gal can be specifically directed to and detected in the nucleus by inclusion of a 
nuclear localization sequence (nls) (Hughes and Blau, Nature, 345:350-352 (1990)), 
activity fi-om the P-gal hybrid protein may be detected in the nucleus. Knowledge of the 
site of localization in the cytoplasm or nucleus will aid in assessing the function of the 
protein interactions, e.g. sequestration and inhibiting activity, or promoting activity. This 
method permits visualization of fluorescent markers of myogenesis, such as desmin, and 
creatine kmase, in correlation with the localization of P-gal, using the sensitive Fluor-X- 
Gal substrate described above (Mohler, W. A., & Blau, H. M,, Proc. Natl Acad ScL, 
USA, 93:12423-12427 (1996)). 

All fusion constmcts between myogenic regulators and complementing P-gal 
mutants described in the following sections may be tested in a muscle cell where 
heterodimerization of the endogenous myogenic regulator is known to occur. In addition, 
the followmg controls also may be performed. The myoD-Aa construct may be 
contransduced into the cell with FKBP12-A(d, and the E12-Aq> cbnstmct may be 
cotransduced with FRAP-Aa. This combination of constructs should result in no p-gal 
activity, unless some unusual mechanism exists in the particular cell type being tested that 
enhances complementation of the weakly complementing p-gal peptides independent of 
heterodimerization of the non-p-gal parts of the molecule. The FRAP-Aa and FKBP12- 
Ag) may also be cotransduced and cells treated with rapamycin as a positive control for 
complementation in each cell type. Cells in high serum medium (growth medium) and 
cells in low serum medium (differentiation medium) should/will give different results. 

Example 5: In vivo assay for the effect of growth factors and substrates on 
heterodimerization and homodimerization. 

Using the constructs described above in Example 4, C2C12 myoblasts are 
transduced with one of the myogenic HLH fusion constructs and the El 2- A© construct. 
Although C2C12 cells will already contain endogenous myogenic HLH proteins and El 2, 
the chimeric constructs will act as a "tracer" to measure the extent of heterodimerization. 
Transduced cells then may be stimulated to either differentiate or proliferate by changes in 
serum levels or the addition of growth factors (TGF-p, bFGF, IGF-I and IGF-II) in the 
presence or absence of substrates such as fibronectin or laminin. p-gal activity then is 
measured as a function of time. Rapid changes in p-gal activity after growth factor 
stimulation may suggest a more direct mechanism of action of a given extracellular signal 



on the fomiation of specific heterodimers. Slower changes may indicate that the ■ ■ 
extracellular signal acts indirectly, for example by up-regulating the expression of a 
competing factor which can sequester one or both fusion proteins. Changes in P-gal 
activity may be correlated with the expression levels of known inhibitors of differentiation 
such as Id proteins, measured by Northern blot in parallel samples. A comparison of the 
kinetics of changes in p-gal activity obtained with each pair of test proteins in parallel 
experiments will mdicate whether specific MRFs (muscle regulatory factors, Yun et al. 
(1996) Curr. Opin. Cell Biol. 8:877-879; and Cossu et al. (1996) Trends Genet.. 12:218- 
223) or inhibitors differ in their ability to respond to extracellular signals. When a growth 
factor or substrate capable of influencing heterodimer formation (or nuclear translocation) 
is identified, the experiments are repeated in other, non-myogenic cell types. The analysis 
of the effect of a specific growth factor in different cell types indicates whether the 
intracellular components of the corresponding signal transduction pathway are tissue- 
specific. 

These studies in tissue culture cells permit the relative affinity and 
compartmentalization of specific protein partners under conditions of growth and 
differentiation, and subsequentiy in response to known signal ti^sducers, to be evaluated. 
The interactions of these factors may be tested in a relevant physiological background in 
competition with the prevalent endogenous components present in the cell at the time. 
Most analyses of the interactions of myogenic factors performed thus far have been carried 
out in vitro, in purified systems, or in yeast (Benezra et al. Cell, 61:1213-1230 (1990); 
Lassar era/., Ce//, 66:305-315 (1991); Hue/ fl/.,M)/. Cell. 5/o/., 12:1031-1042 (1992); 
Chen et al. Cell, 86:731-741 (1996); and Spicer et al. Science, 272:1476-1480 (1996). 
The relatively low sensitivity of flie biochemical methods used to directiy detect 
interactions in mammalian cells, such as immunoprecipitation or activation of a reporter 
gene construct, required high levels of protein and overexpression of the constiuct, usually 
obtained by ti-ansient transfection, levels that could potentially force an interaction due to 
increased concentration. The methods disclosed herein permit protein-protein interactions 
tiiat are functionally relevant at different points in tiie myogenic differentiation pathway to 
be studied. Clearly, the extracellular and intiracellular milieu determines the stoichiometry 
and abundance of the these proteins at different times. As a result, competition of different 
proteins for the same dimerization partners, cofactors, and kinases or phosphatases in 



signal transduction pathways could have significant effects on which complexes actually 
form in intact cells. To assess the nature of such endogenous interactions, low expression 
levels are needed in order not to alter the levels inherent to the cell and characteristic of the 
'^competitive" environment at a given time. Advantageously, high-level expression of the 
introduced proteins is not required in the systems described herein in order to assess the 
protein-protein interactions of interest. Indeed, by contrast with transient transfection 
assays or even most retroviral vectors with strong promoters and high translation 
efficiencies, the systems disclosed herein provide levels that should not perturb the natural 
endogenous physiological levels of the proposed test proteins in the cell. 

Example 6 : Analysis of inhibitory and myogenic HLH proteins in mice. 

The heterodimerization of inhibitory and myogenic HLH proteins in mice may be 
mapped. Mtwist and I-mf have been shown to inhibit myogenesis in mammaUan tissue 
culture systems. In addition, they have been proposed to act via direct physical association 
with myogenic HLH proteins (Hebrok et al. Dev. Biol, 165:537-544 (1994); Rohwedel 
aU Exp. Cell Res., 220:92-100 (1995); Ghen et al, Ce//, 86:731-741 (1996); Spicer et ah, 
iSc/ewce, 272:1476-1480 (1996)). During embryogenesis, Mtwist is expressed throughout 
the epithelial somite and is later excluded from the myotome (Fuchtbauer, Dev. Dyn., 
204:316-322 (1995); and Stoetzel etal, Mech Dev, 51:251-263 (1995)). Although I-mf 
expression has not been analyzed at early stages of somitogenesis, at 1 1 .5 days post-coitum 
I-mf is highly expressed in the sclerotome but is excluded from the myotome (Chen et al. 
Cell, 86:731-741 (1996)). Thus, based on their expression domains in the embryo, these 
factors are thought to be critical for spatial and temporal restriction of the myogenic 
program in early development. 

Further support for this hypothesis derives from analyses of myf5//acZ embryos in 
which the myfS coding region has been targeted and replaced by lacZ. Using p-gal as a 
marker of the myfS expression pattern, cells expressing myf5 are detected in the presomitic 
mesoderm, where Mtwist is also expressed (Fuchtbauer, Dev, Dyn., 204:316-322 (1995); 
and Stoetzel et al, Mech Dev. 52:251-263 (1995)), long before the onset of myogenesis 
(Cossu et al. Trends Genet., ^2:21 8-223 (1996)). Later m development, myfS and myoD 
are co-expressed together with Mtwist in the somite before the formation of a distinct 
myotome. Ott, et al. Development, UV. 1097-1 107 (1991); Fuchtbauer, Dev. Dyn., 
204:316-322 (1995); Stoetzel etal, Mech Dev. 51:251-263 (1995); and Cossu et al. 



Trends Genet., 12:218-223 (1996)). These cells do not express other detectable myogenic 
markers (Ott, et al., 1991). Thus, the reporter systems disclosed herein may be used to 
determine if the myfS and MyoD proteins in these cells are maintained in an inactive state 
by interaction with Mtwist and/or I-mf in heteroduners. At subsequent stages of 
development, Mtwist and I-mf are expressed in most of the non-myogenic mesoderm, 
where the expression of myogenic factors is excluded. Smith et al., J. Cell Biol , 127:95- 
105 (1994); Fuchtbauer, Dev. Dyn., 204:316-322 (1995); Stoetzel etal.. Meek Dev. 
51:251-263 (1995); and Chen etal.. Cell, 86:731-741 (1996). Possibly Mtwist and I-mf 
are involved in the creation of a sharp border between the myotome and the adjacent 
tissues at this stage. 

The reporter systems disclosed herein permit detailed studies of the interactions 
between myogenic inhibitors and activators in vivo during embryonic development which 
can provide novel insights into the complex process of patterning during somitogenesis. 
Such studies are iiot limited to mice and can easily be performed in C. elegans, 
Drosophila, Xenopus, zebrafish and otiier experimental organisms. To date, a 
methodology that allows yisu^ization of protein complexes in situ in the embryo has not 
been available. As a result, no definitive evidence is available as to when and. where 
during embryonic development interactions of such HLH het^rodimers might occur. 
Example 7: Detection of HLH heterodimers in mouse embryos 
The p-gal complementation assay is well-suited for the detection of protein-protein 
interactions in vivo. Myf5-Aa, MyoD-Aa and Mtwist-Aco fusion proteins may be 
constructed. Mediation of p-gal complementation with these fusion proteins may be tested 
in the course of performing the experiments described above. Using well-established 
transgenic technology (Thomas and Capecchi, Nature, 324:34-38 (1986); and Capecchi, 
Science, 244: 1288-1292 (1989)), mouse lines may be generated in which one of the myfS, 
MyoD or Mtwist alleles has been replaced with the corresponding fusion protein. Thus 
myf5-Aa, MyoD-Aa and Mtwist-A© fusion proteins will be expressed under the control of 
their endogenous promoters. The expression of the test protein can be verified in these 
mice. The Mtwist-Aco transgenic mouse may then be crossed with the my£5-Aa, and the 
MyoD-Aa transgenic mouse lines, and in each case the offspring may be analyzed in order 
to identify those carrying both of the fusion proteins. P-gal activity should only develop in 
those cells of the embryo in which Mtwist-Ao) physically associates with the myf5-Aa or 



the MyoD-Aa fusion proteins. This analysis allows mapping when and where during 
embryonic development Mtwist is actually interacting with myfS and MyoD to repress the 
myogenic phenotype. 

Example 8: Targeting strategy and engineering of necessary constructs 
The myf5-Aa fusion protein coding sequence may be inserted into the myfS locus 
so that it will be expressed under the control of the endogenous myfS regulatory elements. 
A similar insertion of wild type p-gal in the myfS locus resulting in a fusion with the ATG 
of myfS has been shown to reproduce faithfully the expression pattern of the endogenous 
gene. The targeting construct is based on the published my fSllacZ targeting constmct 
(Tajbakhsh and Buckingham, Proc, Natl Acad Set USA, 91 :747-751 (1994); Tajbakhsh 
et al. Neuron, 13:813-821 (1994); and Tajbakhsh et al. Nature 384:266-270 (1996)), but 
with the following differences: (1) The fusion protein contains the complete myfS coding 
sequence fused to the Aa p-gal. (2) The fusion protein coding sequence is followed by a 
neomycin resistance: gene flanked by FRT sites (FLP recombinase targets). This allows 
G4 1 8 selection of ES cells that have taken up and integrated the targeting construct. (3)~A 
diphtheria toxin expression cassette is located 5' of the region of homology with the myf5 
mouse genomic DNA. During homologous recombination, strand exchange will occur 
within the homology region and as a result the diphtheria toxin expression cassette will be 
excluded following integration (Capecchi, Science, 244: 1288-1292 (1989)). Clones 
resulting from random integration rather than homologous recombination retain diphtheria 
toxin expression and will be selected against during culture, because they will die. The 
surviving clones are characterized by PGR, and the appropriate integration of the construct 
in the myfS genomic locus is confirmed by Southern blot. 

Subsequently, the neomycin selection cassette is removed using a modified version 
of a previously described technique (Fiering et al. Genes Dev., 9^22^2^-22X3 (1995)). 
Briefly, a plasmid expressing a bicistronic message containing FLP recombinase, an 
Internal Ribpsomal Entry Site (IRES) and GFP is transiently transfected into the ES cell 
clones. GFP positive cells are clonally sorted using the fluorescence activated cell sorter 
(FACS). In these cells, FLP deletes the sequences between the two FRT sites, and only the 
P-gal coding sequence remains in the ES cell genome. Aliquots of the sorted clones are 
tested for sensitivity to G418, and in the sensitive clones the accurate deletion of the 
neomycin cassette is confirmed by PGR and Southern blotting. This approach, which 



eliminates the selectable marker, avoids interference between the exogenous promoter * 
driving the selectable marker and the endogenous regulatory sequences as described 
(Olson etaL, Cell, 85:1-4 (1996)). 

Targeting constructs for MyoD and Mtwist have also been described (Rudnicki et 
al. Cell 71 :383-390 (1992); Chen and Behringer, Genes Devel, 9:686-699 (1995)) and 
the relevant constructs may be produced for each. Based on these available reagents, and 
follow^ing the scheme proposed above for the myf5-Aa strategy, vectors to target (Chen 
and Behringer, Genes Devel, 9:686-699 (1995)) MyoD-Aa and Mtwist-AcD fusions into 
the endogenous MyoD and Mtwist loci of ES cells may be constructed. In each case, an 
ES cell line syngeneic to the available genomic DNA homology regions in the targeting 
construct are used, as strain differences are known to reduce the frequency of homologous 
recombination. The same FLP-mediated excision methodology used for tiie myf5 "knock 
in" described above is applied to the deletion of the neomycin resistance markers from the 
targeted MypD and Mtv^st loci. This "in-put" strategy ensures that the fusion protein 
coding regions are under the control of the endogenous regulatory , elements and associated 
wdth minimal extraneous flanking DNA sequences. 

Example 9: Analysis of the mvf5-Aa/Mtwist-Aa) and 

MvoD-Aa/Mtwist-Ao) transgenic lines 
For each construct, multiple ES cell clones are injected into blastocysts. The 
chimeric offspring obtained upon implantation into pseudopregnant females are tested for 
germline transmission, and heterozygous mice are obtained. One critical control in this 
experiment is to confirm that the expression pattern of the "knocked-in" fusion proteins 
faithfully mimics that reported for the corresponding endogenous factors. For this 
purpose, a system allowing rapid detection of the fusion proteins is provided. A transgenic 
mouse strain expressing a P-gal mutant (A^i) capable of strong complementation with 
either Aa or A© is generated. A^i is expressed ubiquitously from the strong chicken P- 
actin promoter. MyoD-Ao, myf5-Aa and Mtwist-AcD transgenic mouse lines are each 
crossed with the A^ transgenic mice. Since co-expression of any of these fusion proteins 
with the strongly complementing A(i mutant should result in readily detectable P-gal 
activity, it is thus possible to follow the expression pattern of our fusion proteins by X-gal 
staining of the embryos. 



The Mtwist-Aco mouse line is crossed with MyoD-Aa and myfS-Aa transgenic 
mouse lines. As hetero2ygous mice are used for these crosses, on average 1/4 of the 
embryos will be double heterozygotes. These embryos are analyzed at different time 
points during development by staining whole mount preparations and histological sections 
with X-gal. The sections also are stained with the more sensitive Fluor-X-Gal fluorescent 
substrate (Mohler, W. A., & Blau, H. M., Proc, Natl Acad. Set, USA, 93:12423-12427 
(1996)), to detect those cells in which the Mtwist-MyoD or the Mtwist-myf5 interaction is 
a rare event and the p-gal signal is consequently lower. 

The strength of this approach is that p-gal activity should only appear in cells in 
which the interactions described above take place in vivo. This approach allows a 
thorough analysis of the interplay between inhibitors and activators of myogenesis during 
development In particular, it allows analysis of the correlation between co-expression and 
a physical interaction of two proteins as heterodimers in an in vivo setting, the developing 
mouse embryo. This is particularly important in the case of factors which, like Mtwist, are 
known to be involved in multiple control steps (Chen and Behringer, Genes Devel, 9:686- 
699 (1 995)) and are likely to carry out their functions through interaction with different 
determination factors. 

The use of P-gal complementation mutants also can be extended to an analysis of 
I-mf. I-mf has also been implicated as a negative regulator of myogenesis in the embryo 
(Chen et al. Cell, 86:731-741 (1996)). Interestingly however, I-mf and Mtwist are co- 
expressed throughout most of the somite. It is not clear if their presence in the same cells 
is merely an indication of the existence of redundant mechanisms for repressing the 
activity of the myogenic HLH regulators or whether the two factors have distinct 
functions. In the first case, the prediction would be that both I-mf and Mtwist associate 
with the same factors. In the second case, differences and interactions with different 
factors should be detectable using our experimental approach. 

Example 10: Analysis of protein interactions by Fluorescence-Activated 
Cell Sorting (FACS) 

The p-gal activity of a population of C2C12 cells that were coinfected with 
FRAP-Aa and FKBP12-Aa) (as described in Examples 1 and 2) was assayed in the 
presence and absence of 10 ng/ml rapamycin by FACS. FACS was carried out according 
to methods that are well-known in the art, e.g., Nolan et al (1988) Proc, Natl Acad Set 



USA 85:2603-2607. Using this assay, increased P-gal activity was detected in the majority 
of cells after 90 minutes of rapamycin treatment (Figure 6 A). A range of expression levels 
was observed, as evidenced by the breadth of the peak of emission in the presence and 
absence of the drug (compare light and dark profiles in Figure 6A). This breadth is 
presumably due to variable efficiency of expression of each of the retroviral vectors 
following integration in the target cell. This inference is supported by the finding that 
when the 25% of cells expressing the lowest P-gal activity in the absence of rapamycin 
were collected (Figure 6B) and reassayed in the presence and absence of rapamycin, the 
treated and untreated cell populations yield non-overlapping peaks by FACS analysis, 
mdicating a clear distinction between the treated (light peak) and untreated (dark peak) 
populations (Figure 6C). Thus, non-overlapping populations of cells distinguished by the 
expression (or non-expression) of complementing fiision proteins can be identified and 
isolated by FACS. 

Example 11 r . Mpnitoring of EGF Receptor Dimerization in Living Cells 
A previously unrecognized mode of regulation of the epidermal growth factor 
(EOF) receptor signaling pathway that acts through receptor dimerization was revealed 
using the methods of the invention for monitoring protein-protein interactions at the 
membrane of live cells. Chimeric proteins containmg the extracellular and transmembrane 
domains of the EGF receptor, fused to weakly complementing p-galactosidase (p-gal) 
deletion mutants, were expressed in myoblasts. Treatment of the cells with EGF resulted 
in chimeric receptor dimerization as assessed by a rapid increase in p-gal enzymatic 
activity. Further treatment with EGF did not restimulate dimerization unless an inhibitor 
of EGF receptor tyrosine kinase was added. These results reveal a feedback mechanism in 
which tyrosine kinase activity of the dimeric receptor inhibits further dimeri2ation of the 
receptor. 

Methods 

Construction of chimeric receptors. The weakly complementing Aa and 

A(o deletion mutants of p-gal were each linked to a polypeptide sequence containing the 
extracellular and transmembrane domains of the human EGF receptor to form chimeric 
receptor molecules. The chimeric receptors lacked the cytoplasmic domain, and attendant 
tyrosine kinase activity, of the native receptor. The procedure was as follows. The 
sequence coding for the extracellular and transmembrane domains of the human EGF 



receptor (amino acids 1-655) was amplified by polymerase chain reaction (PGR) using 
primers that incorporated an Ncol site at the 5' end and an Xhol site at the 3* end of the 
PGR product. Although this fragment retains threonine 654, which is a site of protein 
kinase G (PKC) phosphorylation, arginines 656 and 657 are removed, destroying the 
consensus PKG recognition sequence. The amino acid sequence beginning with threonine 
654 is thr-leu-glu-ser-met, with the met residue being the beginning of the p-gal sequence. 
The glu and ser codons are generated by the junction sequence and are not native to either 
EGForp-gal. 

DNAs encoding the chimeric receptors were inserted into a retroviral vector also 
encoding a selectable marker. For the construct containing the EOF receptor-Aa fusion, 
the selectable marker was the neo gene, encoding G418 resistance; while the EOF 
receptor-AcD fusion specified hygromycin resistance (Figure IB). Accordingly, the EOF 
receptor PGR product was digested and cloned into the Ncol and Xhol sites of the 
pWZL-Aa and pWZL-Aco vectors. The pWZL-Aa-neo and pWZL-Aco-hygro plasmids 
were constructed by cloning the /acZ Aa and A© deletion mutants into pWZL-neo and 
pWZL-hygro, respectively. Mohler aL^ supra; and Rossi et aL (1997) Proc. Natl Acad. 
Scl USA 94:8405-8410. Plasmids were transfected into (})NX cells using Lipofectamine 
(Life Technologies), and virus-containing supernatant was harvested 48-72 hours later. 
G2F3 mouse myoblasts (Rastinejad et al (1993) Cell 72:903-917) maintained in DME 
with 20% fetal bovine serum (FBS) in 10% GOj, were infected by overnight incubation in 
the viral supernatant. Gells containing both constructs were selected in 1 mg/ml G41 8 plus 
1 mg/md hygromycin, and were maintained in 400 |ag/ml of each antibiotic. 

EGF treatment and FACS analysis Gells were treated with mouse salivary 

gland EGF (Sigma) at 100 ng/ml and in some experiments were treated with tyrphostin 
AG1478 (Galbiochem) at 100 nM. Following all treatments, cells were rinsed with 
phosphate buffered saline (PBS), trypsinized, and resuspended in PBS + 5% FBS. 
Fluorescein di-p-D-galactopyranoside (FDG; Molecular Probes) was loaded into the cells 
by hypotonic shock as described. Fieringe/a/. (1991 )CvroAwerrv 12:29 1-3 01 and Nolan 
etal (1988) Proc. Natl. Acad ScL USA 85:2603-2607. Gells were kept on ice until 
analysis on the cell sorter, which was conducted 1 to 2 hours after trypsinization. 

The chimeric receptor was detected by immunofluorescence using a monoclonal 
mouse anti-human EGF receptor antibody diluted 1 :100 (clone EGFRl, Dako) and either 



phycoerythrin-labeled horse anti-mouse IgG (Vector) or fluorescein-labeled goat anti-: ' 
mouse IgG (Cappel) diluted 1 :100. Cells were tiypsinized and stained in PBS + 5% FBS. 
For each sample, FACS analysis data was collected for 5000 cells. Cells were cloned on a 
Becton-Dickinson FACS Vantage and analyzed on a Becton-Dickinson FACScan at the 
Stanford Shared FACS Facility. Data analysis was facilitated by FlowJo software (Tree 
Star, Inc.). Data shown here as FACS profiles were adjusted for autofluorescence using 
autofluorescence compensation. Albertie/ a/. (1987) Cytometry 8:114-119. Mean 
fluorescence data were adjusted for autofluorescence and for endogenous mammalian p- 
gal activity by subtractmg the mean fluorescence of untransduced cells loaded with FDG 
substrate. 

Results 

Receptor dimerization assay. The two chimeric DNAs were each cloned 

into retroviral vectors encoding selectable markers (Fig. 7B) and transduced into the C2F3 
mou^e myoblast cell line. After selection with G41 8 and hygromycin, P-gal enzyme • 
activity was monitored using the fluorescence activated cell sorter (FACS) to measure the 
cleavage product of a fluorogenic substrate. In the absence of EGF, the population of 
transduced cells consisted of a mixture of cells with low and high levels of p-gal activity 
(Fig. 7C, light gray curve), which was not unexpected given that the EGF receptor is 
capable of dimerizing in the absence of EGF. Gadella et al (1 995) J. Cell Biol 129: 1 543- 
1 558. FoUowmg stimulation of the population of cells with EGF many of the cells 
exhibited increased p-gal activity (Fig. 7C, dark gray curve). FACS analysis with an 
antibody specific to the human EGF receptor showed that the cells expressed a broad range 
of levels of the chimeric receptor (Fig. 7D, medium gray curve). Clones from this 
population were isolated and screened for low background leyels of P— gal activity in the 
absence of EGF, and increased levels of p-gal activity in the presence of EGF. One such 
clone had a low level of chimeric receptor expression relative to the population (Fig. 7D, 
dark gray curve) and exhibited a several-fold increase in p-gal activity in the presence of 
EGF (Fig. 7E), indicating dimerization of the chimeric receptor. Dimerization was also 
observed following stimulation with other EGF-like growth factors that bind and activate 
the EGF receptor, such as TGF-a, heparin-binding EGF-like growth factor, and 
betacellulin; but not with EGF-like factors, such as heregulin a, that act through related 
receptors other than the EGF receptor. BeerU et al. (1996) J. Biol. Chem. 271:6071-6076. 



Dimerization, expressed as the mean fluorescence or p-gal activity of the cells, could be 
detected with EGF treatments as short as one minute, and dimerization increased rapidly 
with longer exposure to EGF (Fig. 7F). 

Time-course of EGF Receptor dimerization. In order to follow the fate of 

receptor diraers over time, cells from the same clone described above were cultured in 
media containing EGF for 0 to 24 hours and then analyzed by FACS. Dimerization peaked 
after 2 to 4 hours in EGF, and then decreased (Fig. 8). The fold increase in dimerization 
and the rate of the ensuing decline in dimerization differed among experiments, but the 
overall pattern was consistent, and was also observed with the original population of 
uncloned cells. By contrast, measurement of the levels of the chimeric receptor on the cell 
surface by inmiimofluorescence using the FACS showed that the amount of chimeric 
receptor on the cell surface remained essentially constant over the period that dimerization 
markedly decreased (Fig. 8, dashed line). It was concluded that the decline in dimerization 
was due to either the depletion of EGF from the media, or to an inhibition of receptor 
dimerization. 

Feedback regulation of EGF Receptor dimerization During the decline in 

dimerization, the response to a second EGF treatment was minimal, suggesting that the 
cells were resistant to further EGF-mediated dimerization despite the continued presence 
of the chimeric receptor on the cell surface. By contrast, if, following EGF treatment, cells 
were incubated in media lacking EGF for several hours, dimerization could be restimulated 
with a second treatment of EGF. This indicated that the continued presence of EGF in the 
media was the basis for the continued inhibition of dimerization of the receptor, A 
possible explanation for these results is that signaling through the endogenous wild-type 
EGF receptors in the cells inhibits dimerization of the chimeric receptor. A test of this 
hypothesis was possible, using AGMTS, a highly specific inhibitor of the EGF receptor 
tyrosine kinase, Levitzki et al (1995) Science 267:1782-1788. 

Accordingly, cells expressing chimeric receptor were treated with EGF overnight, 
and then retreated with EGF or tyrphostin. As shown in Figure 9A (left panel), sample I 
received a single overnight treatment with 100 ng/ml EGF. Samples II and III also were 
treated with EGF overnight, and then retreated with 100 ng/ml EGF for 2 hours (sample 
II), or 100 nM tyrphostin AG1478 for 2 hours (sample III). Sample IV received a single 2 
hour treatment with 100 ng/ml EGF, and sample V received no treatment. The results 



(Figure 9A, right panel) show that treatment of the cells with tyrphostin led to an increase' 
in dimerization, yielding dimerization levels that were comparable to the peak levels 
observed after a single two hour treatment with EGF, indicating that EGF receptor tyrosine 
kinase activity is involved in inhibiting receptor dimerization. Tyrphostin treatment also 
caused an increase in the amount of p-gal activity observed when previously unstimulated 
cells were treated with EGF. Cells were treated with EGF and tyrphostin, or EGF alone, 
over periods ranging from 0-24 hours. Cells that received both tyrphostin and EGF 
showed greater p-gal activity than cells that received EGF alone, for treatment times of up 
to 6 hours (Fig. 9B). By 8 hours of treatment, there was no difference in EGF receptor 
dimerization between EGF-treated cells and EGF+tyrphostin-treated cells. Repeated 
administration of tyrphostin every four hours did not further prolong the increased P-gal 
activity. 

These results show that inhibition of receptor tyrosine kinase can relieve a feedback 
mhibition of receptor dimerization. Protein kinase C, phosphorylation can decrease 
receptor binding affmity for EGF. by phosphorylating the receptor on sites in the - 
cytoplasmic domaiii. Hpweyer, since the chimeric receptor used in the experiments 
described herein lacks the known sites of PKC phosphorylation, the inhibition of 
dimerization observed with this receptor must be mediated through the extracellular or 
transmembrane regions of the receptor. 

These results also demonstrate that, using the methods and compositions of the 
invention, it is possible to monitor EGF receptor dimerization in live cells. They show, in 
addition, that receptor kinase activity is involved in regulating dimerization, the first step 
after ligand binding in EGF signal transduction. Dimerization is measurable following 
treatment of cells with EGF after as little as one minute, which indicates that the p-gal 
complementation is able to monitor the rapid production of newly formed protein dimers. 
Previous data on EGF binding, receptor intemalization, and substrate phosphorylation also 
indicate that the receptor responds to ligand within minutes. Felder etal. (1992) J. Cell. 
Biol. 117:203-2 12; and Kiyokawa et al. (1 997) J. Biol. Chem. 272: 1 8656-1 8665. 
Although receptor dimerization declines after a few hours, the chimeric receptor remains 
on the cell surface and is refractory to further dimerization in response to EGF. Inhibition 
of the endogenous receptor tyrosuie kinase, however, permits further dimerization. 
Inhibition of receptor dimerization begins immediately following receptor activation, as 



shown by the observation that including tyrphostin with the initial EGF treatment increases 
dimerization over the levels observed with EGF alone. 

The kinetics of complementation reflect the kinetics of association of the binding 
partners The decline in EGF receptor dimerization is in contrast to 

observations using p-gal complementation to monitor the interaction of FRAP and 
FKBP12. See Examples 1, 2 and 10, supra\ see also Rossi et al (1997), supra. Using 
p-gal complementation to detect the rapamycin-mediated interaction between FRAP and 
FKBP12, the slowest increase in p-gal activity was seen at the earliest time points 
following the addition of rapamycin, but p-gal activity continued to increase for at least 20 
hours. This could be due to stabilization of the chimeric protein interactions by formation 
of the active P-gal complex. With EGF receptor dimerization, however, the most rapid 
increase in P-gal activity was seen at the earliest time points after the addition of EGF to 
the media; whereas, after 2 to 4 hours, the p-gal activity declined. The difference between 
these results indicates that the dimerization kinetics observed with p-gal complementation 
are not simply a reflection of p-gal complementation kinetics or stabilization, but reflect, at 
least to some degree^ the kinetics of interaction of the non-p-gal portions of the chimeric 
proteins. The results also show that p-gal complementation can monitor the regulation of 
dimerization by other proteins. 

Comparison to previous methods Receptor dimerization has typically 

been studied by in vitro methods such as chemical cross-linking and immunopurification, 
followed by gel electrophoresis. Yarden et al (1987) Biochemistry 26: 1443-1451. 
Recently, EGF receptor dimerization has also been analyzed by fluorescence resonance 
energy transfer (FRET). Gadella et al (1995) supra. Fluorescein and rhodamine labeled 
EGF was added to cells, and dimerization of the receptor was measured microscopically. 
Low temperature incubations and fixation of the cells was required to prevent 
intemahzation of the receptor before analysis, a problem that was avoided in the present 
experiments by using a non-intemalizing mutant receptor. FRET can also be used to study 
interactions of fluorescently-labeled molecules within the cell or cell membrane; however, 
labeling and introduction of these molecules at sufficiently high concentration can be 
cumbersome. It has recently been shown that green fluorescent protein can be modified 
and used for FRET analysis on genetically expressed proteins. Miyawaki et al (1997) 



Nature 388:882-887. The GFP signal, however, cannot be en2ymatically ampUfied aS is'' 
the case with p-gal. 

Thus, p-gal complementation provides a rapid method for monitoring receptor 
dimerization in live cells. This method can be used for high throughput screening for 
pharmacological agents that can bind to a number of receptors and act as either agonists or 
aitagonists. Binding data alone cannot indicate whether or not an agent can elicit a 
response; identifying a response, by analysis of downstream effects such as 
phosphorylation, involves destruction of the cells followed by in vitro analysis, p-gal 
complementation will also enable a screen for novel dimerization partners in a mammaUan 
"two-hybrid" assay that, in the case of membrane receptors, can offer new insight into the 
regulation of signal transduction pathways. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be apparent to 
those skilled in the art that certain changes and modifications may be practiced. Therefore 
the foregoing descriptions and examples should not be construed as limiting the. scope of ~ 
the invention. 



CLAIMS 

What is claimed is: 

1 . A reporter system component comprising: 

a first low-afifinity reporter subunit, coupled to a first putative binding moiety; 

wherein the first low-affinity reporter subunit is capable of association with at least 
a second low-affinity reporter subimit to generate a detectable signal, said association 
being mediated by the first putative binding moiety. 

2. The reporter system component of claim 1 wherein the first putative binding 
moiety is a protein. 

3 . The reporter system component of claim 2 wherein the protein is selected 
firom the group consisting of members of a signal transduction cascade^ cell surface 
receptors, proteins regulating apoptdsis, proteins that regulate progression of the cell-cycle, 
proteins involved in the development of tumors, transcriptional-regulatory proteins, 
translational regulatory proteins, proteins that affect cell interactions, cell adhesion 
molecules, proteins which are members of ligand-receptor pairs, proteins that participate in 
the folding of other proteins, and proteins involved in targeting to intracellular 
compartments. 

4. A reporter system comprising the reporter system component of claim 1 and 
fiirther comprising at least a second low-affinity reporter subunit coupled to a second 
putative binding partner of the first putative binding moiety. 

5. The reporter system of claim 4 wherein the binding affinity of the putative 
binding moieties for each other is greater than the binding affinity of the first and second 
reporter subunits for each other. 

6. The reporter system of claim 5 wherein the production of the signal is 
dependent upon the binding of the putative binding moieties. 



7. The reporter system of claim 4 wherein the first and second putative ; " 
binding moieties are proteins. 

8. The reporter system of claim 7 wherein the protein is selected from the 
group consisting of members of a signal transduction cascade, cell surface receptors, 
proteins regulating apoptosis, proteins that regulate progression of the cell-cycle, proteins 
involved in the development of tumors, transcriptional-regulatory proteins, translational 
regulatory proteins, proteins that affect cell mteractions, cell adhesion molecules, proteins 
which are members of ligand-receptor pairs, proteins that participate in the folding of other 
proteins, and proteins involved in targeting to intracellular compartments. 

9. The reporter system of claim 7, wherein the first and second reporter 
subunits each comprise a protein, and wherein said proteins are capable of associating to 
catalyze a reaction to produce a detectable signal. 

1 0. The reporter system of claim 9 wherein the associated reporter subunits 
catalyze a reaction to produce a product which is directly detectable as the detectable 
signal. 

1 1 . The reporter system of claim 7 wherein the first and second subunits are 
low affinity binding mutant subunits of a hydrolytic enzyme. 

12. The reporter system of claim 1 1 wherein the first and second subunits are 
low affinity binding mutant subunits of p-galactosidase. 

13. The reporter system component of claim 1 wherein said component 
comprises a fiision protein including said low affinity reporter subunit and said first 
putative binding moiety. 

14. The reporter system component of claim 1 3 wherein the low affinity 
reporter subunit comprises a low affmity binding mutant subunit of P-galactosidase. 



15. A nucleic acid encoding the fusion protein of claim 1 3 . 



16. The nucleic acid of claim 15 further comprising regulatory sequences 
effecting expression of the putative binding protein. 

17. A viral vector comprising the nucleic acid of claim 1 5 . 

18. A cell transformed with the nucleic acid of claim 1 5. 

19. A cell transformed with a nucleic acid encoding the reporter system 
component of claim 13 and a nucleic acid encoding at least a second of said reporter 
system components. 

20. The reporter system component of claim 14 wherein the fusion protein 
further comprises an additional protein sequence between said reporter subunit and said 
putative binding moiety. 

21. A method of determining the occurrence of binding between first and 
second putative binding moieties, the method comprising: 

a) providing a reporter system comprising: 

a first component comprising a first low affinity reporter subunit, 
coupled to the first putative binding moiety, and 

a second component comprising a second low affinity reporter 
subunit coupled to the second putative binding moiety; 

wherein the first low affinity reporter subxmit is capable of association with 
at least the second low affinity reporter subunit to generate a detectable signal, said 
association being mediated by the binding of the first and second putative binding 
moieties; 

b) combining the first component and the second component; and 

c) detecting the presence or absence of the signal 



22. The method of claim 21 wherein the binding affinity of the putative biiidiii'g 
moieties for each other is greater than the binding affinity of the first and second reporter 
subunits for each other. 

23. The method of claim 21 wherein the first and second putative binding 
moieties are proteins. 

24. The method of claun 23 wherein the protein is selected fi-om the group 
consisting of members of a signal transduction cascade, cell surface receptors, proteins 
regulating apoptosis, proteins that regulate progression of the cell-cycle, proteins involved 
in the development of tumors, transcriptional-regulatory proteins, translational regulatory 
proteins, proteins that affect cell interactions, cell adhesion molecules, proteins which are 
members of ligand-receptor pairs, proteins that participate in the folding of other proteins, 
and proteins involved in targeting to intracellular compartments. 

25. . The method of claim 21, wherein the first and second reporter subimits each 
comprise a protein, and wherein said proteins are capable of associating to catalyze a 
reaction to produce a detectable signal. 

26. The method of claim 25 wherein the associated reporter subunits catalyze a 
reaction to produce a product which is directly detectable as the detectable signal. 

27. The method of claim 26 wherein the first and second subunits are low 
affinity binding mutant subunits of p-galactosidase. 

28. The method of claim 2 1 wherem each of said first and second components 
comprises a fiision protein. 

29. The method of claim 28 wherein the low affinity reporter subimit comprises 
a low affinity binding mutant subunit of p-galactosidase. 



30. The method of claim 28 wherein step (a) comprises transforming a cell with 
one or more nucleic acids encoding the fusion proteins. 

3 1 . The method of claim 30 wherein step (c) comprises detecting the signal 
5 within the cell. 

32. The method of claim 30 wherein the one or more nucleic acids encoding the 
fusion proteins further comprise sequences regulating expression of the putative binding 
protein. 

10 

33. The method of claim 30 wherein the fusion proteins are encoded by a viral 

vector. 

34. The method of claim 28 wherein the fusion protein further comprises a 
15 " protem sequence between said reporter subunit and said putative binding moiety. 

35. The method of claim 2 1 wherein the degree of binding is quantitated. 

36. The method of claim 2 1 wherein the method further comprises detecting the 
20 effect of a third moiety on the binding of the first and second binding moieties, the method 

further comprismg, after step (a) and prior to step (b), combining said reporter system with 
said third moiety. 

37. The method of claim 21 wherein the method further comprises determining 
25 potential agonist or antagonist activity of said third moiety. 

38. The method of claim 3 1 wherein the intracellular localization of the signal 
is determined. 



30 



39. The method of claim 21 wherein step (b) comprises combining the first and 
second components in the presence of a substance to determine the effect of the substance 
on binding of the first and second binding moieties. 



40. The method of claim 39 wherein the substance is a putative binding 
inhibitor of binding moieties having a predetermined binding affinity, and wherein the 
absence of the signal in step (c) provides an indicator that the substance is a binding 
inhibitor. 

4 1 . The method of claim 3 9 wherein the substance is a putative promoter of 
binding between binding moieties having low or substantially no binding affinity for each 
other, and wherein the presence of the signal in step (c) provides an indicator that the 
substance is a promoter of binding of the binding moieties. 

42. The method of claim 39 wherein the first and second reporter units, and first 
and second binding moieties, each are proteins; 

wherein the components provided |n step (a) each comprise a fusion protein 
. including the reporter subunit and the binding moiety; : 

wherein step (b) comprises expressing nucleic acid sequences encoding the first 
and second components within a cell suspected to contain the substance which inhibits or 
promotes binding of the binding moieties; and 

wherein step (c) comprises detecting the presence or absence of the signal in the 
cell or lysate thereof, thereby to determine the presence or absence in the cell of the 
substance which acts as an inhibitor or promoter of binding between the binding moieties. 

43. The method of claim 39 wherein the substance is selected from the group 
consisting of a protein, lipid, carbohydrate, nucleic acid and a small molecule 
pharmaceutical. 

44. A method of screening for bmding of a first binding moiety with members 
of a plurality of different second putative binding moieties, the method comprising: 

a) providing a plurality of reporter systems each comprising: 

a first component comprising a first low affinity reporter subunit coupled to 
the first binding moiety, and 



one of a plurality of second components each comprising a second low 
affinity reporter subunit coupled to one of said plurality of second putative binding 
moieties, wherein in each of said second components, said second putative binding 
moiety is different; 

wherein the first low affinity reporter subunit is capable of association with 
the second low affinity reporter subunit to generate a detectable signal upon the 
binding of the first binding moiety with one of said different second putative 
binding moieties; 

b) individually combining the first component with each of the plurality of 
second components to produce a plurality of binding assay samples, each of which 
includes the first component and a different one of the second components; and 

c) detecting the presence or absence of the signal in each of the binding assay 
samples. 

45. The method of claim 44 wherein the first and second components each 
comprise a fusion protein including the binding moiety and the reporter subtmit. 

46. The method of claun 45 wherein, in step (b), the components are expressed 
fi-om a nucleic acid sequence introduced into a cell. 

47. The method of claim 46, wherein the plurality of second putative binding 
moieties are encoded by members of a cDNA library. 

48. The method of claim 47, wherein the cell is a eukaryotic cell. 

49. The method of claim 48, wherein the cell is a mammalian cell. 

50. The method of claim 49, wherein the cell is a human cell. 



51. 



The method of claim 44, wherein, in step (c), the signal is quantitated. 



52. The method of claim 44, wherein cells in which binding between the first' ' 
binding moiety and one of the plurality of putative second binding moieties has occurred 
are separated from cells in which said binding has not occurred. 

53. The method of clairai 52, wherein separation is by fluorescence-activated 
cell sorting. 

54. The method of claim 44, wherein the first binding moiety is selected from 
the group consisting of cell surface receptors, transcriptional regulatory proteins, 
translational regulatory proteins, replication proteins, splicing proteins, signal transduction 
proteins, cell-cell adhesion molecules, cell-substrate adhesion molecules, cell-cycle 
proteins, oncogene products, tumor suppressor proteins, membrane receptors, proteins 
regulating apoptosis, developmental regulatory proteins, proteins that affect cell 
interactions, proteins that participate in the folding of other proteins, proteins involved in 
targeting to intracellular compartments, viral proteins and cytoskeletal proteins. 

55. The method of claim 39 wherein the substance is a peptide, drug or 
synthetic analog. 

56. The reporter system of claim 4 wherein the first putative binding moiety 
and the second putative binding partner comprise the same molecule. 

57. A method of determining the occurrence of association between first and 
second moieties, the method comprising: 

a) providing a reporter system comprising: 

a first component comprising a first low affinity reporter subimit, 
coupled to the first moiety, and 

a second component comprising a second low affinity reporter 
subunit coupled to the second moiety; 

wherein the first low affinity reporter subunit is capable of association with 
at least the second low affinity reporter subunit to generate a detectable signal, said 
association being mediated by association between the first and second moieties. 



wherein association between the first and second moieties is mediated by a third 
moiety; 

b) combining the first component, the second component and the third moiety; 
and 

c) detecting the presence or absence of the signal, 

58. The method of claim 57 wherein the association between the first and 
second moieties is mediated by multiple additional moieties. 

59. The reporter system of claim 1 , wherein the low-affinity reporter subunits 
comprise an enzyme and an inhibitor of the enzyme. 

60. The method of claim 39, wherein the substance directly or indirectly affects 
an upstream event which results in an effect on binding of the first and second binding 
moieties. 
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